PCT 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 7 : 
C12N 1/20, 9/04, 15/00 



Al 



(11) Internaaonal Publication Number: WO 00/46346 

(43) International Publication Date: 10 August 2000 (10.08,00) 



(21) International Application Number: PCT/US0Q/02I85 

(22) International Filing Date: 27 January 2000 (27.01.00) 



(30) Priority Data: 
60/118,349 



3 February 1999 (03.02.99) 



US 



(71) Applicant (for all designated States except US): WASHING- 

TON STATE UNIVERSITY RESEARCH FOUNDATION 
[US/US]; N.E. 1615 Eastgatc Boulevard, Pullman, WA 
99164-1802 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): CROTEAU, Rodney, B. 
[US/US]; 1835 N.E Valley Road, Pullman, WA 99163 
(US). LANGE, Bemd, M. [DE/US]; 345 N.W. Irving Street 
#4, Pullman, WA 99163 (US). 

(74) Agent: MCGURL, Barry, F.; Christensen O'Connor Johnson 
& Kindness PLLC, Suite 2800, 1420 Fifth Avenue, Seattle, 
WA 98101 (US). 



(81) Designated States: AE, AL, AM, AT. AU. AZ, BA BB BG 
BR, BY, CA, CH, CN, CR, CU, CZ, DE, DK, DM, EE,' 
ES, FI, GB, GD, GE. GH. GM, HR, HU, ID, IL, IN, IS JP 
KE, KG, KP, KR, KZ, LC, LK, LR. LS, LT. LU, LV, MA, 
MD, MG, MK. MN, MW, MX, NO, NZ, PL, PT RO RU 
SD, SE, SG, SI. SK, SL, TJ, TM, TR, TT, TZ, UA,' UG* 
US, UZ, VN, YU, ZA, ZW, ARIPO patent (GH. GM, KE, 
LS, MW, SD, SL, SZ, TZ, UG, ZW), Eurasian patent (AM 
AZ, BY, KG, KZ, MD, RU, TJ, TM), European patent (AT 
BE, CH, CY, DE, DK, ES, FI, FR, GB, GR. IE. IT. Lu| 
MC, NL, PT, SE), OAPI patent (BF, BJ, CF, CG, CI, CM 
GA, GN, GW, ML, MR, NE, SN, TD, TG). 

Published 

With international search report. 



(54) Title: 1 -DEOX Y-D-XYLULOSEr-5-PHOSPHATE R ED UCTO ISOMER AS ES, AND METHODS OF USE 
(57) Abstract 

The present invention relates to isolated DNA sequences which code for the expression of plant l^eoxy-I>-xyIulose-5-phosphate 
reductiosomerase protein, such as the sequence presented in SEQ ID NO:l which encodes a l^deoxy-D-xylulose-5-phosphate re- 
ductoisomerase protein from peppermint (Mentha x piperita). Additionally, the present invention relates to isolated plant l^de- 
oxy r D_ X y] u i OS e_5_phosphate reductoisomerase protein. In other aspects, the present invention is directed to replicable recombinant clonine 
vehicles comprising a nucleic acid sequence which codes for a plant I-deoxy-D-xylulose-5-phosphate reductoisomerase, to modified host 
cells transformed, transfected, infected and/or injected with a recombinant cloning vehicle and/or DNA sequence of the invention 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


AM 


Armenia 


FT 


Finland 


LT 


AT 


Austria 


FR 


France 


LU 


AU 


Australia 


GA 


Gabon 


LV 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


BB 


Barbados 


CH 


Ghana 


MG 


BE 


Belgium 


CN 


Guinea 


MK 


BF 


Burkina Faso 


GR 


Greece 




BG 


Bulgaria 


HU 


Hungary 


ML 


Bj 


Benin 


IE 


Ireland 


MN 


BR 


Brazil 


IL 


Israel 


MR 


BY 


Belarus 


IS 


Iceland 


MW 


CA 


Canada 


IT 


Italy 


MX 


CF 


Central African Republic 


JP 


Japan 


NE 


CC 


Congo 


KE 


Kenya 


NL 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


a 


Cote dlvoire 


KP 


Democratic People's 


NZ 


CM 


Cameroon 




Republic of Korea 


PL 


CN 


China 


KR 


Republic of Korea 


PT 


cu 


Cuba 


KZ 


Kazakstan 


RO 


cz 


Czech Republic 


LC 


Saint Lucia 


RU 


DE 


Germany 


U 


Liechtenstein 


SD 


DK 


Denmark 


LK 


Sri Lanka 


SE 


EE 


Estonia 


LR 


Liberia 


SC 



Lesotho 


SI 


Slovenia 


Lithuania 


SK 


Slovakia 


Luxembourg 


SN 


Senegal 


Latvia 


sz 


Swaziland 


Monaco 


TD 


Chad 


Republic of Moldova 


TC 


Togo 


Madagascar 


TJ 


Tajikistan 


The former Yugoslav 


TM 


Turkmen isi an 


Republic of Macedonia 


TR 


Turkey 


Mali 


TT 


Trinidad and Tobago 


Mongolia 


UA 


Ukraine 


Mauritania 


uc 


Uganda 


Malawi 


us 


United Stales of America 


Mexico 


uz 


Uzbekistan 


Niger 


VN 


Viet Nam 


Netherlands 


YU 


Yugoslavia 


Norway 


zw 


Zimbabwe 



New Zealand 
Poland 
Ponugat 
Romania 

Russian Federation 

Sudan 

Sweden 

Singapore 



WO 00/46346 



-1- 



PCT/US00/02185 



l-DEOXY-D-XYLULOSE-5-PHOSPHATE REDUCTOISOMERASES, AND 

METHODS OF USE 

Field of the Invention 

5 This invention relates to nucleic acid sequences encoding 1-deoxy-D- 

xylulose-5-phosphate reductoisomerase. 

Background of the Invention 
Isoprenoids are a large and structurally diverse group of compounds that play 
essential roles in plants as hormones, photosynthetic pigments, electron carriers, and 

10 components of membranes, and that also serve in communication and defense 
(Harborne, J.B. (1991) in Ecological Chemistry and Biochemistry of Plant 
Terpenoids (Harborne, J.B., and Tomas-Barberan, R.A., Eds.), pp. 399-426. 
Clarendon Press, Oxford). Until recently, it was widely accepted that all isoprenoids 
were synthesized via the acetate/mevalonate pathway (Spurgeon. S.L., and Porter, 

15 J.W. (1983) in Biosynthesis of Isoprenoid Compounds (Porter, J.W., and Spurgeon, 
S.L., Eds.), Vol. 1 , pp. 1 -46, John Wiley, New York). 

However, evidence has emerged over the last few years that isopentenyl 
diphosphate, the central intermediate of isoprenoid biosynthesis, originates from 
pyruvate and D-glyceraldehyde-3-phosphate via a new mevalonate-independent 

20 pathway in several eubacteria (Rohmer, M., et al., Biochem. J. 295, 517-524 (1993); 
Broers, S.T.J. (1994) Ph.D. Thesis, Eidgenossische Technische Hochschule, Zurich, 
Switzerland; Rohmer, M., et al., J. Am. Chem. Soc. 118, 2564-2566 (1996)), algae 
(Schwender, J., et al., Biochem. J. 316, 73-80 (1996) ), and plant plastids (Schwarz, 
M.K. (1994) Ph.D. Thesis, Eidgenossische Technische Hochschule, Zurich, 
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Switzerland; Lichtenthaler, H.K., et al., FEBS Lett. 400, 271-274 (1997)). The first 
step in this novel pathway involves a transketoiase-type condensation reaction of 
pyruvate and glyceraIdehyde-3-phosphate to yield l-deoxy-D-xylulose-5-phosphate 
(FIGURE 1). Genes encoding the enzyme which catalyzes this reaction, 
deoxyxylulose phosphate synthase, have been cloned from E. coli (Sprenger, G.A., et 
al., Proc. Natl Acad. Sci. USA 94, 12857-12862 (1997); Lois, L.M. et al., Proc. Natl. 
Acad Sci. USA 95, 2105-21 10 (1998)), peppermint (Mentha x piperita) (Lange, B.M. 
et al., Proc. Natl Acad ScL USA 95, 2100-2104 (1998)) and pepper (Bouvier, F. et 
al, Plant Physiol. 117,1423-1431 (1998)). 

The second step of the mevalonate-independent pathway is considered to 
involve an intramolecular rearrangement and subsequent reduction of deoxyxylulose 
phosphate to yield 2-C-methyl-D-eiythritol-4-phosphate (Duvold, T. et al., 
Tetrahedron Lett. 38, 4769-4772 (1997); Duvold, T. et al., Tetrahedron Lett. 3S, 
6181-6184 (1997); Sagner, S. et al., Tetrahedron Lett. 39, 2091-2094 (1998)) 
(FIGURE 1). Seto and coworkers (Takahashi, S. et al., Proc. Natl. Acad. ScL USA 
95, 9879-9884 (1998)) have recently reported the isolation and characterization of a 
reductoisomerase gene from E. coli. The present invention provides a nucleic acid 
molecule isolated from peppermint that encodes a l-deoxy-D-xylulose-5-phosphate 
reductoisomerase. 

Summary of the Invention 
In accordance with the foregoing, a cDNA encoding a 
l-deoxy-D-xylulose-5-phosphate reductoisomerase from peppermint (Mentha 
piperita) has been isolated and sequenced, and the corresponding amino acid 
sequence has been deduced. Accordingly, the present invention relates to isolated 
DNA sequences which code for the expression of plant 
l-deoxy-D-xylulose-5-phosphate reductoisomerase, such as isolated DNA sequences 
which code for the expression of l-deoxy-D-xylulose-5-phosphate reductoisomerase 
from essential oil plants, including plants of the genus Mentha. A representative 
example of an isolated, Mentha DNA sequence which codes for the expression of 
l-deoxy-D-xylulose-5-phosphate reductoisomerase is set forth in SEQ ID NO:l 
which encodes a l-deoxy-D-xylulose-5-phosphate reductoisomerase protein (SEQ ID 
NO:2) from peppermint (Mentha piperita). Additionally, the present invention 
relates to isolated plant l-deoxy-D-xylulose-5-phosphate reductoisomerase proteins 
(including isolated l-deoxy-D-xylulose-5-phosphate reductoisomerase proteins from 
essential oil plants, such as plants of the genus Mentha), including the peppermint 



WO 00/46346 



PCT/USOO/02185 



(Mentha piperita) l-deoxy-D-xylulose-5-phosphate reductoisomerase protein having 
the amino acid sequence set forth in SEQ ID NO:2. 

In another aspect, the present invention relates to nucleic acid molecules that 
hybridize under stringent conditions to the nucleic acid molecule having the sequence 
5 set forth in SEQ ID NO:l, or to its complement, ie. y to an antisense molecule that is 
complementary in sequence to the sequence set forth in SEQ ID NO:l. In other 
aspects, the present invention is directed to replicable recombinant cloning vehicles 
comprising a nucleic acid sequence, e.g., a DNA sequence which codes for a plant 
l-deoxy-D-xylulose-5 -phosphate reductoisomerase, or for a nucleotide sequence 

10 sufficiently complementary to at least a portion of DNA or RNA encoding a plant 
l-deoxy-D-xylulose-5-phosphate reductoisomerase to enable hybridization therewith 
(e.g., antisense RNA or fragments of DNA complementary to a portion of DNA or 
RNA molecules encoding a plant l-deoxy-D-xylulose-5-phosphate reductoisomerase 
which are useful as polymerase chain reaction primers or as probes for plant 

15 l-deoxy-D-xylulose-5-phosphate reductoisomerase genes or related genes). In yet 
other aspects of the invention, modified host cells are provided that have been 
transformed, transfected, infected and/or injected with a recombinant cloning vehicle 
and/or DNA sequence of the invention. 

Thus, the present invention provides for the recombinant expression of plant 

20 l-deoxy-D-xylulose-5-phosphate reductoisomerase, and the inventive concepts may 
be used to facilitate the production, isolation and purification of significant quantities 
of recombinant l-deoxy-D-xylulose-5-phosphate reductoisomerase (or of its primary 
enzyme products) for subsequent use, to obtain expression or enhanced expression of 
l-deoxy-D-xylulose-5-phosphate reductoisomerase in plants, microorganisms or 

25 animals, or may be otherwise employed in an environment where the regulation or 
expression of l-deoxy-D-xylulose-5-phosphate reductoisomerase is desired for the 
production of this enzyme, or its enzyme product, or derivatives thereof. 

Brief Description of the Drawing s 
The foregoing aspects and many of the attendant advantages of this invention 

30 will become more readily appreciated as the same becomes better understood by 
reference to the following detailed description, when taken in conjunction with the 
accompanying drawings, wherein: 

FIGURE 1 shows an outline of the pyruvate/glyceraldehyde-3-phosphate 
pathway for the biosynthesis of isopentenyl diphosphate, and proposed reaction 

35 mechanism of the l-deoxy-D-xylulose-5-phosphate reductoisomerase in the 
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conversion of l-deoxy-D-xylulose-5-phosphate to 2-C-methyl-D- 
erythritol-4-phosphate. The circled P denotes the phosphate moiety. The broken 
arrow indicates several as yet unidentified steps. 

FIGURE 2 shows GC-MS analysis of (A) the trimethylsilyl ether derivative 
of the dephosphorylated biosynthetic product (Rj = 7.1±0.1min) generated by 
recombinant peppermint 1 -deoxy-D-xylulose-5-phosphate reductoisomerase (SEQ ID 
NO:2), and (B) the trimethylsilyl ether derivative of authentic 2-C-methyl-D,L- 
erythritol (R { - 7.1 ± 0.1 min) identically prepared. 

Detailed Description of the Preferred Embodiment 

As used herein, the terms "amino acid" and "amino acids" refer to all 
naturally occurring L-a-amino acids or their residues. The amino acids are identified 
by either the single-letter or three-letter designations: 



Asp 


D 


aspartic acid 


He 


I 


isoleucine 


Thr 


T 


threonine 


Leu 


L 


leucine 


Ser 


S 


serine 


Tyr 


Y 


tyrosine 


Glu 


E 


glutamic acid 


Phe 


F 


phenylalanine 


Pro 


P 


proline 


His 


H 


histidine 


Gly 


G 


glycine 


Lys 


K 


lysine 


Ala 


A 


alanine 


Arg 


R 


arginine 


Cys 


C 


cysteine 


Trp 


W 


tryptophan 


Val 


V 


valine 


Gin 


Q 


glutamine 


Met 


M 


methionine 


Asn 


N 


asparagine 



As used herein, the term "nucleotide" means a monomeric unit of DNA or 
RNA containing a sugar moiety (pentose), a phosphate and a nitrogenous 
heterocyclic base. The base is linked to the sugar moiety via the glycosidic carbon 
(1* carbon of pentose) and that combination of base and sugar is called a nucleoside. 
The base characterizes the nucleotide with the four bases of DNA being adenine 
("A"), guanine ("G"), cytosine ("C") and thymine ("T"). Inosine ("I") is a synthetic 
base that can be used to substitute for any of the four, naturally-occurring bases (A, 
C, G or T). The four RNA bases are A,G,C and uracil ("U"). The nucleotide 
sequences described herein comprise a linear array of nucleotides connected by 
phosphodiester bonds between the 3' and 5' carbons of adjacent pentoses. 

"Oligonucleotide" refers to short length single or double stranded sequences 
of deoxyribonucleotides linked via phosphodiester bonds. The oligonucleotides are 
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chemically synthesized by known methods and purified, for example, on 

polyacrylamide gels. 

The term "l-deoxy-D-xylulose-5-phosphate reductoisomerase" is used herein 

to mean an enzyme capable of forming 2-C-methyI-D-erythritol-4-phosphate from 
5 1 -deoxy-D-xylulose-5-phosphate. 

The term "hybridize under stringent conditions", and grammatical equivalents 

thereof, means that a nucleic acid molecule that has hybridized to a target nucleic 

acid molecule immobilized on a DNA or RNA blot (such as a Southern blot or 

Northern blot) remains hybridized to the immobilized target molecule on the blot 
10 during washing of the blot under stringent conditions. In this context, exemplary 

hybridization conditions are: hybridization in 5 X SSC at 65°C for 16 hours. 

Exemplary high stringency wash conditions are two washes in 2 X SSC at 23°C for 

20 minutes per wash, followed by one wash in 2.0 X SSC at 50°C for 30 minutes. 

Exemplary very high stringency wash conditions are two washes in 2 X SSC at 23°C 
15 for 15 minutes per wash, followed by two washes in 1.0 X SSC at 60°C for 20 

minutes. 

The abbreviation "SSC" refers to a buffer used in nucleic acid hybridization 
solutions. One liter of the 20X (twenty times concentrate) stock SSC buffer solution 
(pH 7.0) contains 175.3 g sodium chloride and 88.2 g sodium citrate. 

20 The term "essential oil plant," or "essential oil plants," refers to a group of 

plant species that produce high levels of monoterpenoid and/or sesquiterpenoid 
and/or diterpenoid oils, and/or high levels of monoterpenoid and/or sesquiterpenoid 
and/or diterpenoid resins. The foregoing oils and/or resins account for greater than 
about 0.005% of the fresh weight of an essential oil plant that produces them. The 

25 essential oils and/or resins are more fully described, for example, in E. Guenther, The 
Essential Oils, Vols. I-VI, R.E. Krieger Publishing Co., Huntington N.Y., 1975, 
incorporated herein by reference. The essential oil plants include, but are not limited 
to: 

Lamiaceae, including, but not limited to, the following species: Ocimum 
30 (basil), Lavandula (Lavender), Origanum (oregano), Mentha (mint), Salvia (sage), 
Rosmarinus, (rosemary), Thymus (thyme), Satureja (savory), Monarda (balm) and 
Melissa. 

Umbelliferae, including, but not limited to, the following species: Carum 
(caraway), Anethum (dill), foeniculum (fennel) and Daucus (carrot). 
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Asteraceae (Compositae), including, but not limited to, the following species: 
Artemisia (tarragon, sage brush), Tanacetum (tansy). 

Rutaceae (e.g., Citrus plants); Rosaceae (e.g., roses); Myrtaceae (e.g., 
Eucalyptus, Melaleuca); the Gramineae (e.g., Cymbopogon (citronella)); Geranaceae 
(Geranium) and certain conifers including Abies (e.g., Canadian balsam), Cedrus 
(cedar), Thuja, Juniperus, Pinus (pines) and Picea (spruces). 

The range of essential oil plants is more fully set forth in K Guenther, The 
Essential Oils, Vols. I-VI, R.E. Krieger Publishing Co., Huntington N.Y., 1975, 
which is incorporated herein by reference. 

The terms "alteration", "amino acid sequence alteration", "variant" and 
"amino acid sequence variant" refer to l-deoxy-D-xylulose-5-phosphate 
reductoisomerase molecules with some differences in their amino acid sequences as 
compared to the corresponding, native, i.e., naturally-occurring, 
l-deoxy-D-xyluIose-5-phosphate reductoisomerases. Ordinarily, the variants will 
possess at least about 70% homology with the corresponding native 
l-deoxy-D-xyluIose-5-phosphate reductoisomerases, and preferably, they will be at 
least about 80% homologous with the corresponding, native 
l-deoxy-D-xylulose-5-phosphate reductoisomerases. The amino acid sequence 
variants of the l-deoxy-D-xylulose-5-phosphate reductoisomerases falling within this 
invention possess substitutions, deletions, and/or insertions at certain positions. 
Sequence variants of l-deoxy-D-xylulose-5-phosphate reductoisomerases may be 
used to attain desired enhanced or reduced enzymatic activity, modified 
regiochemistry or stereochemistry, or altered substrate utilization or product 
distribution. 

Substitutional l-deoxy-D-xylulose-5-phosphate reductoisomerase variants are 
those that have at least one amino acid residue in the native 
l-deoxy-D-xylulose-5 -phosphate reductoisomerase sequence removed and a different 
amino acid inserted in its place at the same position. The substitutions may be single, 
where only one amino acid in the molecule has been substituted, or they may be 
multiple, where two or more amino acids have been substituted in the same molecule. 
Substantial changes in the activity of the l-deoxy-D-xylulose-5-phosphate 
reductoisomerase molecules of the present invention may be obtained by substituting 
an amino acid with a side chain that is significantly different in charge and/or 
structure from that of the native amino acid. This type of substitution would be 
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expected to affect the structure of the polypeptide backbone and/or the charge or 
hydrophobicity of the molecule in the area of the substitution. 

Moderate changes in the activity of the l-deoxy-D-xyluIose-5-phosphate 
reductoisomerase molecules of the present invention would be expected by 
5 substituting an amino acid with a side chain that is similar in charge and/or structure 
to that of the native molecule. This type of substitution, referred to as a conservative 
substitution, would not be expected to substantially alter either the structure of the 
polypeptide backbone or the charge or hydrophobicity of the molecule in the area of 
the substitution. 

10 Insertional l-deoxy-D-xylulose-5-phosphate reductoisomerase variants are 

those with one or more amino acids inserted immediately adjacent to an amino acid 
at a particular position in the native l-deoxy-D-xylulose-5-phosphate 
reductoisomerase molecule. Immediately adjacent to an amino acid means connected 
to either the ct-carboxy or a-amino functional group of the amino acid. The insertion 

15 may be one or more amino acids. Ordinarily, the insertion will consist of one or two 
conservative amino acids. Amino acids similar in charge and/or structure to the 
amino acids adjacent to the site of insertion are defined as conservative. 
Alternatively, this invention includes insertion of an amino acid with a charge and/or 
structure that is substantially different from the amino ^cids adjacent to the site of 

20 insertion. 

Deletional variants are those where one or more amino acids in the native 
l-deoxy-D-xylulose-5 -phosphate reductoisomerase molecules have been removed. 
Ordinarily, deletional variants will have one or two amino acids deleted in a 
particular region of the l-deoxy-D-xylulose-5-phosphate reductoisomerase molecule. 
25 Deletional variants include those where all or most of the transit sequence has been 
removed. 

The terms "biological activity", "biologically active", "activity" and "active" 
refer to the ability of the l-deoxy-D-xylulose-5-phosphate reductoisomerases of the 
present invention to catalyze the formation of 2-C-methyl-D-erythritol-4-phosphate 
30 by reduction and rearrangement of l-deoxy-D-xylulose-5-phosphate. 1-Deoxy- 
D-xylulose-5-phosphate reductoisomerase activity is measured in an enzyme activity 
assay, such as the assay described in Example 3 herein. Amino acid sequence 
variants of the l-deoxy-D-xylulose-5-phosphate reductoisomerases of the present 
invention may have desirable altered biological activity including, for example, 
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altered reaction kinetics, substrate utilization, product distribution or other 
characteristics such as regiochemistry and stereochemistry. 

The terms "DNA sequence encoding", "DNA encoding" "nucleic acid 
molecule encoding" and "nucleic acid encoding" refer to the order or sequence of 
5 deoxyribonucleotides along a strand of deoxyribonucleic acid. The order of these 
deoxyribonucleotides determines the order of amino acids along the translated 
polypeptide chain. The DNA sequence thus codes for the amino acid sequence. 

The terms "replicable vector" "replicable expression vector" and "expression 
vector" refer to a piece of DNA, usually double-stranded, which may have inserted 

10 into it another piece of DNA (the insert DNA) such as, but not limited to, a cDNA 
molecule. The vector is used to transport the insert DNA into a suitable host cell. 
The insert DNA may be derived from the host cell, or may be derived from a 
different cell or organism. Once in the host cell, the vector can replicate 
independently of or coincidental with the host chromosomal DNA, and several copies 

15 of the vector and its inserted DNA may be generated. The terms "replicable 
expression vector" and "expression vector" refer to replicable vectors that contain the 
necessary elements that permit transcription and translation of the insert DNA into a 
polypeptide. Many molecules of the polypeptide encoded by the insert DNA can thus 
be rapidly synthesized. 

20 The terms "transformed host cell," "transformed" and "transformation" refer 

to the introduction of DNA into a cell. The cell is termed a "host cell", and it may be 
a prokaryotic or a eukaryotic cell. Typical prokaryotic host cells include various 
strains of E. coli. Typical eukaryotic host cells are plant cells, yeast cells, insect cells 
or animal cells. The introduced DNA is usually in the form of a vector containing an 

25 inserted piece of DNA. The introduced DNA sequence may be from the same 
species as the host cell or from a different species from the host cell, or it may be a 
hybrid DNA sequence, containing some foreign DNA and some DNA derived from 
the host species. 

Other abbreviations used are: bp, base pair; GC, gas chromatography; HPLC, 
30 high performance liquid chromatography; IPTG, 

isopropyl-l-thio-P-D-galactopyranoside; kb, kilobase pairs; MS., mass spectrometry; 
Tris, Tris-(hydroxymethyl)aminomethane. 

In accordance with the present invention, cDNAs encoding 1-deoxy- 
D-xylulose-5-phosphate reductoisomerase from Peppermint (Mentha x piperita) were 
35 r isolated and sequenced in the following manner. A cDNA library was constructed 
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from mRNA from isolated peppermint oil gland secretory cells, a cell type highly 
specialized for essential oil biosynthesis. PCR primers were designed (PI, 
S'-CGAGATTATGCCAGGAGAGC^ 1 (SEQ ID NO:3); P2, 
5 -GGCTTC AGGC AAACCCTTG-3 ' and employed with peppermint oil gland 
5 library cDNA as template to amplify a 223 bp fragment designated pMPDXRl (SEQ 
ID NO:5) with some similarity (-50%) to the E. coli reductoisomerase gene. By 
screening the peppermint oil gland cDNA library (2.5 x 10 4 plaques) with a labeled 
probe derived from pMPDXRl (SEQ ID NO:5), five full-length clones were 
obtained, including the cDNA having the nucleic acid sequence set forth in SEQ ID 
10 NO:l. 

Additionally, cDNA molecules encoding l-deoxy-D-xylulose-5-phosphate 
reductoisomerase were isolated from Arabidopsis thaliana in the following manner. 
2 x 10 4 plaques of an A. thaliana flower bud cDNA library (CD4-6 from the 
Arabidopsis Biological Resource Center (http://aims.cps.msu.edu/aims/)) were 

15 screened with pMPDXRl (SEQ ID NO:5) and afforded 20 positive clones, including 
the clone having the sequence set forth in SEQ ID NO:6 encoding the 5'-truncated 
protein having the amino acid sequence set forth in SEQ ID NO:7. 

The full-length peppermint 1 -deoxy-D-xylulose-5-phosphate 
reductoisomerase cDNA (having the sequence set forth in SEQ ID NO: 1) expressed a 

20 functional l-deoxy-D-xyIulose-5-phosphate reductoisomerase protein (SEQ ID NO:2) 
in E. coli, as described in Example 3 herein. 

The isolation of cDNAs encoding l-deoxy-D-xylulose-5-phosphate 
reductoisomerase from peppermint permits development of efficient expression 
systems for this functional enzyme; provides useful tools for examining the 

25 developmental regulation of l-deoxy-D-xylulose-5-phosphate reductoisomerase; 
permits investigation of the reaction mechanism(s) of this enzyme, and permits the 
isolation of other l-deoxy-D-xylulose-5 -phosphate reductoisomerases, such as other 
plant l-deoxy-D-xylulose-5-phosphate reductoisomerases. The isolation of 1-deoxy- 
D-xylulose-5-phosphate reductoisomerase cDNAs also permits the transformation of 

30 a wide range of organisms in order to enhance, or otherwise alter, isoprenoid 
synthesis and metabolism. 

For example, in one aspect the present invention provides methods of 
enhancing the level of expression of 1 -deoxy-D-xylulose-5-phosphate 
reductoisomerase in a host cell (such as a plant cell) including the step of introducing 

35 into a host cell a replicable expression vector that includes a nucleic acid molecule 
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that encodes a l-deoxy-D-xylulose-5-phosphate reductoisomerase protein under 
conditions that enable expression of the l-deoxy-D-xylulose-5-phosphate 
reductoisomerase in the host cell. By way of representative example, in addition to 
the nucleic acid molecule having the sequence set forth in SEQ ID NO:l herein, 
nucleic acid molecules encoding the l-deoxy-D-xylulose-5-phosphate 
reductoisomerase protein reported in Schwender et al., FEBS Letters 455(1-2): 140- 
144 (1999), which publication is incorporated herein by reference, are useful in this 
aspect of the invention. The Schwender et al protein is deposited in the Genbank 
database under the Genbank Accession No. CAB43344. In one embodiment of this 
aspect of the invention, nucleic acid sequences that encode 1 -deoxy-D-xylulose- 
5-phosphate reductoisomerase hybridize under stringent conditions to the antisense 
complement of the nucleic acid sequence set forth in SEQ ID NO: 1 . 

Again by way of non-limiting example, in another aspect the present 
invention provides methods of reducing the level of expression of 1-deoxy- 
D-xylulose-5-phosphate reductoisomerase in a host cell (such as a plant cell) 
including the step of introducing into a host cell a replicable expression vector that 
includes a nucleic acid molecule that hybridizes under stringent conditions to the 
nucleic acid sequence set forth in SEQ ID NO:l. Thus, for example, in addition to 
the antisense complement of the nucleic acid sequence set forth in SEQ ID NO:l 
herein, representative nucleic acid molecules useful in this aspect of the invention 
include the antisense complements of the following nucleic acid molecules 
(identified by their Genbank database accession numbers): AI781096, AW256284, 
A W065057, A W286486, AI727207, AI90 1 056. 

Although the l-deoxy-D-xylulose-5-phosphate reductoisomerase protein 
encoded by the peppermint cDNA, disclosed herein, direct the enzyme to plastids, 
substitution of the presumptive targeting sequence of this enzyme with other 
transport sequences well known in the art (See, for example, the following 
publications, the cited portions of which are incorporated by reference herein: 
vonHeijne etal., Eur. 1 Biochem., 180:535-545, 1989; Stryer, Biochemistry, 
W.H. Freeman and Company, New York, NY, p. 769 [1988]) may be employed to 
direct l-deoxy-D-xylulose-5-phosphate reductoisomerase to other cellular or 
extracellular locations. 

In addition to native, plant l-deoxy-D-xylulose-5-phosphate reductoisomerase 
amino acid sequences, sequence variants produced by deletions, substitutions, 
mutations and/or insertions and truncations are intended to be within the scope of the 
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invention except insofar as limited by the prior art. The 1-deoxy-D-xylulose- 
5-phosphate reductoisomerase amino acid sequence variants of this invention may be 
constructed by mutating the DNA sequences that encode the wild-type enzymes, such 
as by using techniques commonly referred to as site-directed mutagenesis. Nucleic 
5 acid molecules encoding the l-deoxy-D-xylulose-5-phosphate reductoisomerases of 
the present invention can be mutated by a variety of PCR techniques well known to 
one of ordinary skill in the art. (See, for example, the following publications, the 
cited portions of which are incorporated by reference herein: "PCR Strategies", MA. 
Innis, D.H. Gelfand and J.J. Sninsky, eds., 1995, Academic Press, San Diego, CA 

10 (Chapter 14);,"PCR Protocols: A Guide to Methods and Applications", MA. Innis, 
D.H. Gelfand, JJ. Sninsky and TJ. White, eds., Academic Press, NY (1 990). 

By way of non-limiting example, the two primer system utilized in the 
Transformer Site-Directed Mutagenesis kit from Clontech, may be employed for 
introducing site-directed mutants into the l-deoxy-D-xylulose-5-phosphate 

15 reductoisomerase genes of the present invention. Following denaturation of the 
target plasmid in this system, two primers are simultaneously annealed to the 
plasmid; one of these primers contains the desired site-directed mutation, the other 
contains a mutation at another point in the plasmid resulting in elimination of a 
restriction site. Second strand synthesis is then carried out, tightly linking these two 

20 mutations, and the resulting plasmids are transformed into a mutS strain of E. coli. 
Plasmid DNA is isolated from the transformed bacteria, restricted with the relevant 
restriction enzyme (thereby linearizing the unmutated plasmids), and then 
retransformed into E. coli. This system allows for generation of mutations directly in 
an expression plasmid, without the necessity of subcloning or generation of single- 

25 stranded phagemids. The tight linkage of the two mutations and the subsequent 
linearization of unmutated plasmids results in high mutation efficiency and allows 
minimal screening. Following synthesis of the initial restriction site primer, this 
method requires the use of only one new primer type per mutation site. Rather than 
prepare each positional mutant separately, a set of "designed degenerate" 

30 oligonucleotide primers can be synthesized in order to introduce all of the desired 
mutations at a given site simultaneously. Transformants can be screened by 
sequencing the plasmid DNA through the mutagenized region to identify and sort 
mutant clones. Each mutant DNA can then be fully sequenced or restricted and 
analyzed by electrophoresis on Mutation Detection Enhancement gel (J.T. Baker) to 
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confirm that no other alterations in the sequence have occurred (by band shift 
comparison to the unmutagenized control). 

Again, by way of non-limiting example, the two primer system utilized in the 
QuikChange™ Site-Directed Mutagenesis kit from Stratagene (LaJolla, California), 
may be employed for introducing site-directed mutants into the 1-deoxy-D-xylulose- 
5-phosphate reductoisomerase genes of the present invention. Double-stranded 
plasmid DNA, containing the insert bearing the target mutation site, is denatured and 
mixed with two oligonucleotides complementary to each of the strands of the 
plasmid DNA at the target mutation site. The annealed oligonucleotide primers are 
extended using Pfu DNA polymerase, thereby generating a mutated plasmid 
containing staggered nicks. After temperature cycling, the unmutated, parental DNA 
template is digested with restriction enzyme Dpnl which cleaves methylated or 
hemimethylated DNA, but which does not cleave unmethylated DNA. The parental, 
template DNA is almost always methylated or hemimethylated since most strains of 
£. coli, from which the template DNA is obtained, contain the required methylase 
activity. The remaining, annealed vector DNA incorporating the desired mutation(s) 
is transformed into E. coll. 

The mutated l-deoxy-D-xylulose-5-phosphate reductoisomerase gene can be 
cloned into a pET (or other) overexpression vector that can be employed to transform 
£ coli such as strain E. coli BL21(DE3)pLysS, for high level production of the 
mutant protein, and purification by standard protocols. Examples of plasmid vectors 
and E. coli strains that can be used to express high levels of the 1 -deoxy-D-xylulose- 
5-phosphate reductoisomerase proteins of the present invention are set forth in 
Sambrook et al, Molecular Cloning, A Laboratory Manual, 2nd Edition (1989), 
Chapter 17, incorporated herein by reference. The method of FAB-MS mapping can 
be employed to rapidly check the fidelity of mutant expression. This technique 
provides for sequencing segments throughout the whole protein and provides the 
necessary confidence in the sequence assignment. In a mapping experiment of this 
type, protein is digested with a protease (the choice will depend on the specific region 
to be modified since this segment is of prime interest and the remaining map should 
be identical to the map of unmutagenized protein). The set of cleavage fragments is 
fractionated by microbore HPLC (reversed phase or ion exchange, again depending 
on the specific region to be modified) to provide several peptides in each fraction, 
and the molecular weights of the peptides are determined by FAB-MS. The masses 
are then compared to the molecular weights of peptides expected from the digestion 
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of the predicted sequence, and the correctness of the sequence quickly ascertained. 
Since the exemplary mutagenesis techniques set forth herein produce site-directed 
mutations, sequencing of the altered peptide should not be necessary if the mass 
spectrograph agrees with prediction. If necessary to verify a changed residue, 
CAD-tandem MS/MS can be employed to sequence the peptides of the mixture in 
question, or the target peptide can be purified for subtractive Edman degradation or 
carboxypeptidase Y digestion depending on the location of the modification. 

In the design of a particular site directed mutagenesis experiment, it is 
generally desirable to first make a non-conservative substitution (e.g., Ala for Cys, 
His or Glu) and determine if activity is greatly impaired as a consequence. The 
properties of the mutagenized protein are then examined with particular attention to 
the kinetic parameters of K m and k cat as sensitive indicators of altered function, from 
which changes in binding and/or catalysis per se may be deduced by comparison to 
the native enzyme. If the residue is by this means demonstrated to be important by 
activity impairment, or knockout, then conservative substitutions can be made, such 
as Asp for Glu to alter side chain length, Ser for Cys, or Arg for His. For 
hydrophobic segments, it is largely size that is usefully altered, although aromatics 
can also be substituted for alkyl side chains. Changes in the normal product 
distribution can indicate which step(s) of the reaction sequence have been altered by 
the mutation. Modification of the hydrophobic pocket can be employed to change 
binding conformations for substrates and result in altered regiochemistry and/or 
stereochemistry. 

Other site directed mutagenesis techniques may also be employed with the 
nucleotide sequences of the invention. For example, restriction endonuclease 
digestion of DNA followed by ligation may be used to generate deletion variants of 
l-deoxy-D-xylulose-5-phosphate reductoisomerase, as described in section 15.3 of 
Sambrook et al. Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold Spring 
Harbor Laboratory Press, New York, NY [1989], incorporated herein by reference. 
A similar strategy may be used to construct insertion variants, as described in 
section 15.3 of Sambrook et al., supra. 

Oligonucleotide-directed mutagenesis may also be employed for preparing 
substitution variants of this invention, as well as truncations. It may also be used to 
conveniently prepare the deletion and insertion variants of this invention. This 
technique is well known in the art as described by Adelmanetal. (DNA 2:183 
[1983]); Sambrook et al., supra; "Current Protocols in Molecular Biology", 1991, 
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Wiley (NY), F.T. Ausubel, R. Brent, R.E. Kingston, D.D. Moore, J.D. Seidman, J.A. 
Smith and K. Struhl, eds, incorporated herein by reference. 

Generally, oligonucleotides of at least 25 nucleotides in length are used to 
insert, delete or substitute two or more nucleotides in the 1-deoxy-D-xylulose- 
5 5-phosphate reductoisomerase molecule. An optimal oligonucleotide will have 12 
to 15 perfectly matched nucleotides on either side of the nucleotides coding for the 
mutation. To mutagenize wild-type l-deoxy-D-xylulose-5-phosphate 

reductoisomerase, the oligonucleotide is annealed to the single-stranded DNA 
template molecule under suitable hybridization conditions. A DNA polymerizing 

10 enzyme, usually the Klenow fragment of E. coli DNA polymerase I, is then added. 
This enzyme uses the oligonucleotide as a primer to complete the synthesis of the 
mutation-bearing strand of DNA. Thus, a heteroduplex molecule is formed such that 
one strand of DNA encodes the wild-type enzyme inserted in the vector, and the 
second strand of DNA encodes the mutated form of the enzyme inserted into the 

15 same vector. This heteroduplex molecule is then transformed into a suitable host 
cell. 

Mutants with more than one amino acid substituted may be generated in one 
of several ways. If the amino acids are located close together in the polypeptide 
chain, they may be mutated simultaneously using one oligonucleotide that codes for 

20 all of the desired amino acid substitutions. If, however, the amino acids are located 
some distance from each other (separated by more than ten amino acids, for example) 
it is more difficult to generate a single oligonucleotide that encodes all of the desired 
changes. Instead, one of two alternative methods may be employed. In the first 
method, a separate oligonucleotide is generated for each amino acid to be substituted. 

25 The oligonucleotides are then annealed to the single-stranded template DNA 
simultaneously, and the second strand of DNA that is synthesized from the template 
will encode all of the desired amino acid substitutions. An alternative method 
involves two or more rounds of mutagenesis to produce the desired mutant. The first 
round is as described for the single mutants: wild-type 1-deoxy-D-xylulose- 

30 5-phosphate reductoisomerase DNA is used for the template, an oligonucleotide 
encoding the first desired amino acid substitution(s) is annealed to this template, and 
the heteroduplex DNA molecule is then generated. The second round of mutagenesis 
utilizes the mutated DNA produced in the first round of mutagenesis as the template. 
Thus, this template already contains one or more mutations. The oligonucleotide 

35 encoding the additional desired amino acid substitution(s) is then annealed to this 
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template, and the resulting strand of DNA now encodes mutations from both the first 
and second rounds of mutagenesis. This resultant DNA can be used as a template in 
a third round of mutagenesis, and so on. 

A gene (or other nucleic acid molecule) encoding 1-deoxy-D-xylulose- 
5 5-phosphate reductoisomerase may be incorporated into any organism (intact plant, 
animal, microbe, etc.), or cell culture derived therefrom. The enzyme 
l-deoxy-D-xylulose-5-phosphate reductoisomerase catalyzes the first committed step 
in the conversion of 1 -deoxy-D-xylulose-5-phosphate to isopentenyl diphosphate 
which, in turn, is converted to a variety of molecules including, for example, 

10 carotenoids, and the prenyl side chains of chlorophyll, plastoquinone and 
tocopherols. Thus, a l-deoxy-D-xylulose-5-phosphate reductoisomerase gene (or 
other nucleic acid molecule) may be introduced into any organism for a variety of 
purposes including, but not limited to: production of 1-deoxy-D- 
xylulose-5-phosphate reductoisomerase, or its product 2-C-methyl-D-erythritol- 

15 4-phosphate; enhancement of chlorophyll production by increasing the synthesis of 
the phytol side-chain; enhancement of production of terpenoids, phytoalexins, toxins, 
and deterrent compounds to improve defense against pathogens, insects and other 
herbivores; enhance the production of monoterpene flavor and aroma compounds in 
essential oil plants, fruits and vegetables to improve the flavor and aroma profiles, or 

20 improve the yield of flavor and aroma compounds extracted from plants; to prepare 
synthetic intermediates in plants and microbes for industrial uses, such as the 
synthesis of adhesives, inks and polymers; to enhance the production of natural 
pigments, such as carotenoids, in plants, and to improve the yield of natural pigments 
extracted from plants for medicinal or culinary uses; to enhance the yield in plants of 

25 compounds having anti-cancer or other nutraceutical properties, such as vitamin A 
and vitamin E; and to produce 2C-methyl-D-erythritol phosphate as an enzymatic or 
chemical intermediate. While the nucleic acid molecules of the present invention can 
be introduced into any organism, the nucleic acid molecules of the present invention 
will preferably be introduced into a plant species. 

30 Eukaryotic expression systems may be utilized for the production of 1-deoxy- 

D-xylulose-5-phosphate reductoisomerase since they are capable of carrying out any 
required posttranslational modifications and of directing the enzyme to the proper 
cellular compartment. A representative eukaryotic expression system for this 
purpose uses the recombinant baculovirus, Autographa calif ornica nuclear 

35 polyhedrosis virus (AcNPV; M.D. Summers and G.E. Smith, A Manual of Methods 
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for Baculovirus Vectors and Insect Cell Culture Procedures [1986]; Luckow.et al., 
Bio-technology, 6:47-55 [1987]) for expression of the 1 -deoxy-D-xylulose- 
5-phosphate reductoisomerases of the invention. Infection of insect cells (such as 
cells of the species Spodoptera frugiperda) with the recombinant baculoviruses 
5 allows for the production of large amounts of the l-deoxy-D-xylulose-5-phosphate 
reductoisomerase proteins. In addition, the baculovirus system has other important 
advantages for the production of recombinant l-deoxy-D-xylulose-5-phosphate 
reductoisomerase. For example, baculoviruses do not infect humans and can 
therefore be safely handled in large quantities. In the baculovirus system, a DNA 

10 construct is prepared including a DNA segment encoding 
l-deoxy-D-xyIulose-5-phosphate reductoisomerase and a vector. The vector may 
comprise the polyhedron gene promoter region of a baculovirus, the baculovirus 
flanking sequences necessary for proper cross-over during recombination (the 
flanking sequences comprise about 200-300 base pairs adjacent to the promoter 

15 sequence) and a bacterial origin of replication which permits the construct to 
replicate in bacteria. The vector is constructed so that (i) the DNA segment is placed 
adjacent (or operably linked or "downstream" or "under the control of) to the 
polyhedron gene promoter and (ii)the promoter/1 -deoxy-D-xylulose-5-phosphate 
reductoisomerase combination is flanked on both sides by 200-300 base pairs of 

20 baculovirus DNA (the flanking sequences). 

To produce the l-deoxy-D-xylulose-5-phosphate reductoisomerase DNA 
construct, a cDNA clone encoding the full length l-deoxy-D-xylulose-5-phosphate 
reductoisomerase is obtained using methods such as those described herein. The 
DNA construct is contacted in a host cell with baculovirus DNA of an appropriate 

25 baculovirus (that is, of the same species of baculovirus as the promoter encoded in 
the construct) under conditions such that recombination is effected. The resulting 
recombinant baculoviruses encode the full l-deoxy-D-xylulose-5-phosphate 
reductoisomerase. For example, an insect host cell can be cotransfected or 
transfected separately with the DNA construct and a functional baculovirus. 

30 Resulting recombinant baculoviruses can then be isolated and used to infect cells to 
effect production of the l-deoxy-D-xylulose-5-phosphate reductoisomerase. Host 
insect cells include, for example, Spodoptera frugiperda cells, that are capable of 
producing a baculovirus-expressed 1 -deoxy-D-xylulose-5-phosphate 
reductoisomerase. Insect host cells infected with a recombinant baculovirus of the 

35 present invention are then cultured under conditions allowing expression of the 
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baculovirus-encoded 1 -deoxy-D-xylulose-5-phosphate reductoisomerase. 
l-deoxy-D-xylulose-5-phosphate reductoisomerase thus produced is then extracted 
from the cells using methods known in the art. 

Other eukaryotic microbes such as yeasts may also be used to practice this 
5 invention. The baker's yeast Saccharomyces cerevisiae, is a commonly used yeast, 
although several other strains are available. The plasmid YRp7 (Stinchcomb et al., 
Nature, 282:39 [1979]; Kingsman et al., Gene 7:141 [1979]; Tschemper et al., Gene, 
10:157 [1980]) is commonly used as an expression vector in Saccharomyces, This 
plasmid contains the trpl gene that provides a selection marker for a mutant strain of 

10 yeast lacking the ability to grow in tryptophan, such as strains ATCC No. 44,076 and 
PEP4-1 (Jones, Genetics, 85:12 [1977]). The presence of the trpl lesion as a 
characteristic of the yeast host cell genome then provides an effective environment 
for detecting transformation by growth in the absence of tryptophan. Yeast host cells 
are generally transformed using the polyethylene glycol method, as described by 

15 Hinnen (Proc. Natl. Acad Set USA, 75:1929 [1978]). Additional yeast 
transformation protocols are set forth in Gietz etal., N.A.R., 20(17): 1425(1992); 
Reeves etal., FEMS, 99(2-3): 193-197, (1992), both of which publications are 
incorporated herein by reference. 

Suitable promoting sequences in yeast vectors include the promoters for 

20 3-phosphoglycerate kinase (Hitzeman et al., J. Biol. Chem. y 255:2073 [1980]) or 
other glycolytic enzymes (Hess etal., J. Adv. Enzyme Reg. 7:149 [1968]; 
Holland et al., Biochemistry, 17:4900 [1978]), such as enolase, gIyceraldehyde-3- 
phosphate dehydrogenase, hexokinase, pyruvate decarboxylase, phosphofructokinase, 
glucose-6-phosphate isomerase, 3-phosphoglycerate mutase, pyruvate kinase, 

25 triosephosphate isomerase, phosphoglucose isomerase, and glucokinase. In the 
construction of suitable expression plasmids, the termination sequences associated 
with these genes are also ligated into the expression vector 3' of the sequence desired 
to be expressed to provide polyadenylation of the mRNA and termination. Other 
promoters that have the additional advantage of transcription controlled by growth 

30 conditions are the promoter region for alcohol dehydrogenase 2, isocytochrome C, 
acid phosphatase, degradative enzymes associated with nitrogen metabolism, and the 
aforementioned glyceraldehyde-3-phosphate dehydrogenase, and enzymes 
responsible for maltose and galactose utilization. Any plasmid vector containing 
yeast-compatible promoter, origin of replication and termination sequences is 

35 suitable. 
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Cell cultures derived from multicellular organisms, such as plants, may be 
used as hosts to practice this invention. Transgenic plants can be obtained, for 
example, by transferring plasmids that encode l-deoxy-D-xylulose-5-phosphate 
reductoisomerase and a selectable marker gene, e.g., the kan gene encoding 

5 resistance to kanamycin, into Agrobacterium tumifaciens containing a helper Ti 
plasmid as described in Hoeckema et al., Nature, 303:179-181 [1983] and culturing 
the Agrobacterium cells with leaf slices, or other tissues or cells, of the plant to be 
transformed as described by Anetal., Plant Physiology, 81:301-305 [1986]. 
Transformation of cultured plant host cells is normally accomplished through 

0 Agrobacterium tumifaciens. Cultures of mammalian host cells and other host cells 
that do not have rigid cell membrane barriers are usually transformed using the 
calcium phosphate method as originally described by Graham and VanderEb 
{Virology, 52:546 [1978]) and modified as described in sections 16.32-16.37 of 
Sambrooketal., supra. However, other methods for introducing DNA into cells 
such as Polybrene (Kawai and Nishizawa, Mol Cell Biol, 4:1172 [1984]), 
protoplast fusion (Schaffher, Proc, Natl Acad. Scl USA, 77:2163 [1980]), 
electroporation (Neumann et al., EMBOJ., 1:841 [1982]), and direct microinjection 
into nuclei (Capecchi, Cell, 22:479 [1980]) may also be used. Additionally, animal 
transformation strategies are reviewed in Monastersky G.M. and Robl, J.M.. 
Strategies in Transgenic Animal Science, ASM Press, Washington, D.C., 1995, 
incorporated herein by reference. Transformed plant calli may be selected through 
the selectable marker by growing the cells on a medium containing, e.g., kanamycin, 
and appropriate amounts of phytohormone such as naphthalene acetic acid and 
benzyladenine for callus and shoot induction. The plant cells may then be 
regenerated and the resulting plants transferred to soil using techniques well known 
to those skilled in the art. 

In addition, a gene regulating l-deoxy-D-xylulose-5-phosphate 
reductoisomerase production can be incorporated into the plant along with a 
necessary promoter which is inducible. In the practice of this embodiment of the 
invention, a promoter that only responds to a specific external or internal stimulus is 
fused to the target cDNA. Thus, the gene will not be transcribed except in response 
to the specific stimulus. As long as the gene is not being transcribed, its gene product 
is not produced. 

An illustrative example of a responsive promoter system that can be used in 
the practice of this invention is the glutathione-S-transferase (GST) system in maize. 
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GSTs are a family of enzymes that can detoxify a number of hydrophobic 
electrophilic compounds that often are used as pre-emergent herbicides 
(Weigand et al., Plant Molecular Biology, 7:235-243 [1986]). Studies have shown 
that the GSTs are directly involved in causing this enhanced herbicide tolerance. 
5 This action is primarily mediated through a specific 1.1 kb mRNA transcription 
product. In short, maize has a naturally occurring quiescent gene already present that 
can respond to external stimuli and that can be induced to produce a gene product. 
This gene has previously been identified and cloned. Thus, in one embodiment of 
this invention, the promoter is removed from the GST responsive gene and attached 

10 to a l-deoxy-D-xylulose-5-phosphate reductoisomerase gene that previously has had 
its native promoter removed. This engineered gene is the combination of a promoter 
that responds to an external chemical stimulus and a gene responsible for successful 
production of l-deoxy-D-xylulose-5-phosphate reductoisomerase. 

In addition to the methods described above, several methods are known in the 

15 art for transferring cloned DNA into a wide variety of plant species^ including 
gymnosperms, angiosperms, monocots and dicots (see, e.g., Glick and 
Thompson, eds., Methods in Plant Molecular Biology, CRC Press, Boca Raton, 
Florida [1993], incorporated by reference herein). Representative examples include 
electroporation-facilitated DNA uptake by protoplasts in which an electrical pulse 

20 transiently permeabilizes cell membranes, permitting the uptake of a variety of 
• biological molecules, including recombinant DNA (Rhodes et al., Science, 
240(4849):204-207 [1988]); treatment of protoplasts with polyethylene glycol 
(Lyznik et al., Plant Molecular Biology, 13:151-161 [1989]); and bombardment of 
cells with DNA-laden microprojectiles which are propelled by explosive force or 

25 compressed gas to penetrate the cell wall (Klein etal., Plant Physiol 91:440-444 
[1989] and BoyntonetaL, Science, 240(4858):1 534-1538 [1988]). Transformation 
of woody species can be achieved, for example, by employing the methods set forth 
in Han et al, Plant Science, 95:187-196 (1994), incorporated herein by reference. A 
method that has been applied to Rye plants (Secale cereale) is to directly inject 

30 plasmid DNA, including a selectable marker gene, into developing floral tillers (de la 
Pena et al., Nature 325:274-276 (1987)). Further, plant viruses can be used as 
vectors to transfer genes to plant cells. Examples of plant viruses that can be used as 
vectors to transform plants include the Cauliflower Mosaic Virus (Brisson et al., 
Nature 310: 511-514 (1984); Additionally, plant transformation strategies and 

35 techniques are reviewed in Birch, R.G., Ann Rev Plant Phys Plant Mol Biol, 48:297 
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(1997); Forester etal., Exp. Agric, 33:15-33 (1997). Numerous publications 
describe transformation techniques that have been successfully applied to mint 
(Mentha) species. Representative publications disclosing mint transformation 
techniques are: A. Spencer et al., Phytochemistry 32: 911-919 (1993); C. Berry 
5 et al., Plant Cell Tissue Organ Cult. 44: 177-181 (1996); J.C. Caissard et al., Plant 
Cell Rep. 16: 67-70 (1996); X. Niu et al., Plant Cell Rep. 17: 165-171 (1998); F. 
Diemer et al., Plant ScL 138: 101-108 (1998). The aforementioned publications 
disclosing plant transformation techniques are incorporated herein by reference, and 
minor variations make these technologies applicable to a broad range of plant 
10 species. 

Each of these techniques has advantages and disadvantages. In each of the 
techniques, DNA from a plasmid is genetically engineered such that it contains not 
only the gene of interest, but also selectable and screenable marker genes. A 
selectable marker gene is used to select only those cells that have integrated copies of 

15 the plasmid (the construction is such that the gene of interest and the selectable and 
screenable genes are transferred as a unit). The screenable gene provides another 
check for the successful culturing of only those cells carrying the genes of interest. A 
commonly used selectable marker gene is neomycin phosphotransferase II (NPT II), 
This gene conveys resistance to kanamycin, a compound that can be added directly to 

20 the growth media on which the cells grow. Plant cells are normally susceptible to 
kanamycin and, as a result, die. The presence of the NPT II gene overcomes the 
effects of the kanamycin and each cell with this gene remains viable. Another 
selectable marker gene which can be employed in the practice of this invention is the 
gene which confers resistance to the herbicide glufosinate (Basta). A screenable gene 

25 commonly used is the p -glucuronidase gene (GUS). The presence of this gene is 
characterized using a histochemical reaction in which a sample of putatively 
transformed cells is treated with a GUS assay solution. After an appropriate 
incubation, the cells containing the GUS gene turn blue. 

The plasmid containing one or more of these genes is introduced into either 

30 plant protoplasts or callus cells by any of the previously mentioned techniques. If the 
marker gene is a selectable gene, only those cells that have incorporated the DNA 
package survive under selection with the appropriate phytotoxic agent. Once the 
appropriate cells are identified and propagated, plants are regenerated. Progeny from 
the transformed plants must be tested to insure that the DNA package has been 

35 successfully integrated into the plant genome. 
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Mammalian host cells may also be used in the practice of the invention. 
Examples of suitable mammalian cell lines include monkey kidney CVI line 
transformed by SV40 (COS-7, ATCC CRL 1651); human embryonic kidney 
line293S (Graham et al., J. Gen. Virol, 36:59 [1977]); baby hamster kidney cells 
5 (BHK, ATCC CCL 10); Chinese hamster ovary cells (Urlab and Chasin, Proc. Natl 
Acad. Sci USA 77:4216 [1980]); mouse Sertoli cells (TM4, Mather, Biol Reprod, 
23:243 [1980]); monkey kidney cells (CVI-76, ATCC CCL 70); African green 
monkey kidney cells (VERO-76, ATCC CRL- 1587); human cervical carcinoma cells 
(HELA, ATCC CCL 2); canine kidney cells (MDCK, ATCC CCL 34); buffalo rat 

10 liver cells (BRL 3A, ATCC CRL 1442); human lung cells (W138, ATCC CCL 75); 
human liver cells (Hep G2, HB 8065); mouse mammary tumor cells (MMT 060562, 
ATCC CCL 51); rat hepatoma cells (HTC, MI.54, Baumann et al., 1 Cell Biol, 85:1 
[1980]); and TRI cells (Mather et al., Annals N.Y. Acad. Sci., 383:44 [1982]). 
Expression vectors for these cells ordinarily include (if necessary) DNA sequences 

15 for an origin of replication, a promoter located in front of the gene to be expressed, a 
ribosome binding site, an RNA splice site, a polyadenylation site, and a transcription 
terminator site. 

Promoters used in mammalian expression vectors are often of viral origin. 
These viral promoters are commonly derived from polyoma virus, Adenovirus 2, and 

20 most frequently Simian Virus 40 (SV40). The SV40 virus contains two promoters 
that are termed the early and late promoters. These promoters are particularly useful 
because they are both easily obtained from the virus as one DNA fragment that also 
contains the viral origin of replication (Fiers et al., Nature, 273:1 13 [1978]). Smaller 
or larger SV40 DNA fragments may also be used, provided they contain the 

25 approximately 250-bp sequence extending from the Hindlll site toward the Bgll site 
located in the viral origin of replication. 

Alternatively, promoters that are naturally associated with the foreign gene 
(homologous promoters) may be used provided that they are compatible with the host 
cell line selected for transformation. 

30 An origin of replication may be obtained from an exogenous source, such as 

SV40 or other virus (e.g., Polyoma, Adeno, VSV, BPV) and inserted into the cloning 
vector. Alternatively, the origin of replication may be provided by the host cell 
chromosomal replication mechanism. If the vector containing the foreign gene is 
integrated into the host cell chromosome, the latter is often sufficient. 
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The use of a secondary DNA coding sequence can enhance production levels 
of l-deoxy-D-xyIulose-5-phosphate reductoisomerase in transformed cell lines. The 
secondary coding sequence typically comprises the enzyme dihydrofolate reductase 
(DHFR). The wild-type form of DHFR is normally inhibited by the chemical 
5 methotrexate (MTX). The level of DHFR expression in a cell will vary depending on 
the amount of MTX added to the cultured host cells. An additional feature of DHFR 
that makes it particularly useful as a secondary sequence is that it can be used as a 
selection marker to identify transformed cells. Two forms of DHFR are available for 
use as secondary sequences, wild-type DHFR and MTX-resistant DHFR. The type of 

10 DHFR used in a particular host cell depends on whether the host cell is DHFR 
deficient (such that it either produces very low levels of DHFR endogenously, or it 
does not produce functional DHFR at all). DHFR-deficient cell lines such as the 
CHO cell line described by Urlaub and Chasin, supra, are transformed with wild-type 
DHFR coding sequences. After transformation, these DHFR-deficient cell lines 

15 express functional DHFR and are capable of growing in a culture medium lacking the 
nutrients hypoxanthine, glycine and thymidine. Nontransformed cells will not 
survive in this medium. 

The MTX-resistant form of DHFR can be used as a means of selecting for 
transformed host cells in those host cells that endogenously produce normal amounts 

20 of functional DHFR that is MTX sensitive. The CHO-K1 cell line (ATCC 
No. CL61) possesses these characteristics, and is thus a useful cell line for this 
purpose. The addition of MTX to the cell culture medium will permit only those 
cells transformed with the DNA encoding the MTX-resistant DHFR to grow. The 
nontransformed cells will be unable to survive in this medium. 

25 Prokaryotes may also be used as host cells for the initial cloning steps of this 

invention, or for expressing the proteins of the present invention. They are 
particularly useful for rapid production of large amounts of DNA, for production of 
single-stranded DNA templates used for site-directed mutagenesis, for screening 
many mutants simultaneously, and for DNA sequencing of the mutants generated. 

30 Suitable prokaryotic host cells include E. coli K12 strain 94 (ATCC No. 31,446), 
K coli strain W31 10 (ATCC No. 27,325) E coli X1776 (ATCC No. 31,537), and 
E. coli B; however many other strains of E. coli y such as HB101, JM101, NM522, 
NM538, NM539, and many other species and genera of prokaryotes including bacilli 
such as Bacillus subtilis, other enterobacteriaceae such as Salmonella typhimurium or 

35 Serratia marcesans, and various Pseudomonas species may all be used as hosts. 
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Prokaryotic host cells or other host cells with rigid cell walls are preferably 
transformed using the calcium chloride method as described in section 1.82 of 
Sambrook et aL, supra. Alternatively, electroporation may be used for 
transformation of these cells. Prokaryote transformation techniques are set forth in 
Dower, W.J., in Genetic Engineering, Principles and Methods, 12:275-296, Plenum 
Publishing Corp., 1990; Hanahan et aL, Meth EnzymoL, 204:63 (1991). 

As a representative example, cDNA sequences encoding I-deoxy- 
D-xylulose-5-phosphate reductoisomerase may be transferred to the (His) 6 *Tag pET 
vector commercially available (from Novagen) for overexpression in E. coli as 
heterologous host. This pET expression plasmid has several advantages in high level 
heterologous expression systems. The desired cDNA insert is ligated in frame to 
plasmid vector sequences encoding six histidines followed by a highly specific 
protease recognition site (thrombin) that are joined to the amino terminus codon of 
the target protein. The histidine "block" of the expressed fusion protein promotes 
very tight binding to immobilized metal ions and permits rapid purification of the 
recombinant protein by immobilized metal ion affinity chromatography. The 
histidine leader sequence is then cleaved at the specific proteolysis site by treatment 
of the purified protein with thrombin, and the l-deoxy-D-xylulose-5-phosphate 
reductoisomerase again purified by immobilized metal ion affinity chromatography, 
this time using a shallower imidazole gradient to elute the recombinant 
reductoisomerase while leaving the histidine block still adsorbed. This 
overexpression-purification system has high capacity, excellent resolving power and 
is fast, and the chance of a contaminating E. coli protein exhibiting similar binding 
behavior (before and after thrombin proteolysis) is extremely small. 

As will be apparent to those skilled in the art, any plasmid vectors containing 
replicon and control sequences that are derived from species compatible with the host 
cell may also be used in the practice of the invention. The vector usually has a 
replication site, marker genes that provide phenotypic selection in transformed cells, 
one or more promoters, and a polylinker region containing several restriction sites for 
insertion of foreign DNA. Plasmids typically used for transformation of E. coli 
include pBR322, pUC18, pUC19, pUCI18, pUC119, and Bluescript M13, all of 
which are described in sections 1 .12-1.20 of Sambrook et aL, supra. However, many 
other suitable vectors are available as well. These vectors contain genes coding for 
ampicillin and/or tetracycline resistance which enables cells transformed with these 
vectors to grow in the presence of these antibiotics. 
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The promoters most commonly used in prokaryotic vectors include the 
P-lactamase (penicillinase) and lactose promoter systems (Chang etal. Nature, 
375:615 [1978]; Itakura et al., Science, 198:1056 [1977]; Goeddel et al., Nature, 
281:544 [1979]) and a tryptophan (trp) promoter system (Goeddel et al., Nucl Acids 
5 Res., 8:4057 [1980]; EPO Appl. Publ. No. 36,776), and the alkaline phosphatase 
systems. While these are the most commonly used, other microbial promoters have 
been utilized, and details concerning their nucleotide sequences have been published, 
enabling a skilled worker to ligate them functionally into plasmid vectors (see 
Siebenlist et al., Cell, 20:269 [1980]). 

10 Many eukaryotic proteins normally secreted from the cell contain an 

endogenous secretion signal sequence as part of the amino acid sequence. Thus, 
proteins normally found in the cytoplasm can be targeted for secretion by linking a 
signal sequence to the protein. This is readily accomplished by ligating DNA 
encoding a signal sequence to the 5' end of the DNA encoding the protein and then 

15 expressing this fusion protein in an appropriate host cell. The DNA encoding the 
signal sequence may be obtained as a restriction fragment from any gene encoding a 
protein with a signal sequence. Thus, prokaryotic, yeast, and eukaryotic signal 
sequences may be used herein, depending on the type of host cell utilized to practice 
the invention. The DNA and amino acid sequence encoding the signal sequence 

20 portion of several eukaryotic genes including, for example, human growth hormone, 
proinsulin, and proalbumin are known (see Stiver, Biochemistry W.H. Freeman and 
Company, New York, NY, p. 769 [1988]), and can be used as signal sequences in 
appropriate eukaryotic host cells. Yeast signal sequences, as for example acid 
phosphatase (Arimaetal., Nuc. Acids Res., 11:1657 [1983]), a-factor, alkaline 

25 phosphatase and invertase may be used to direct secretion from yeast host cells. 
Prokaryotic signal sequences from genes encoding, for example, LamB or OmpF 
(Wong et al., Gene, 68:193 [1988]), MalE, PhoA, or beta-lactamase, as well as other 
genes, may be used to target proteins from prokaryotic cells into the culture medium. 
Trafficking sequences from plants, animals and microbes can be employed in 

30 the practice of the invention to direct the 1 -deoxy-D-xylulose-5 -phosphate 
reductoisomerase proteins of the present invention to the cytoplasm, endoplasmic 
reticulum, mitochondria or other cellular components, or to target the protein for 
export to the medium. These considerations apply to the overexpression of 
l-deoxy-D-xylulose-5-phosphate reductoisomerase, and to direction of expression 
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within cells or intact organisms to permit gene product function in any desired 
location. 

The construction of suitable vectors containing DNA encoding replication 
sequences, regulatory sequences, phenotypic selection genes and the 
l-deoxy-D-xyluIose-5-phosphate reductoisomerase DNA of interest are prepared 
using standard recombinant DNA procedures. Isolated plasmids and DNA fragments 
are cleaved, tailored, and ligated together in a specific order to generate the desired 
vectors, as is well known in the art (see, for example, Sambrook et al., supra). 

The l-deoxy-D-xyIulose-5-phosphate reductoisomerase proteins of the 
present invention can be isolated, for example, by incorporating a nucleic acid 
molecule of the invention (such as a cDNA molecule) into an expression vector, 
introducing the expression vector into a host cell and expressing the nucleic acid 
molecule to yield protein. Representative examples of host cells and expression 
vectors are as set forth herein. The protein can then be purified by art-recognized 
means. When a crude protein extract is initially prepared, it may be desirable to 
include one or more proteinase inhibitors in the extract. Representative examples of 
proteinase inhibitors include: serine proteinase inhibitors (such as 
phenylmethylsulfonyl fluoride (PMSF), benzamide, benzamidine HCI, 
e-Amino-rt-caproic acid and aprotinin (Trasylol)); cysteine proteinase inhibitors, such 
as sodium />-hydroxymercuribenzoate; competitive proteinase inhibitors, such as 
antipain and leupeptin; covalent proteinase inhibitors, such as iodoacetate and 
A^thylmaleimide; aspartate (acidic) proteinase inhibitors, such as pepstatin and 
diazoacetylnorleucine methyl ester (DAN); metalloproteinase inhibitors, such as 
EGTA [ethylene glycol bis(P-aminoethyl ether) AWA^'-tetraacetic acid], and the 
chelator 1, 10-phenanthroline. 

Representative examples of art-recognized techniques for purifying, or 
partially purifying, proteins from biological material are exclusion chromatography, 
ion-exchange chromatography, hydrophobic interaction chromatography, 
reversed-phase chromatography and immobilized metal affinity chromatography. 

Hydrophobic interaction chromatography and reversed-phase chromatography 
are two separation methods based on the interactions between the hydrophobic 
moieties of a sample and an insoluble, immobilized hydrophobic group present on 
the chromatography matrix. In hydrophobic interaction chromatography the matrix is 
hydrophilic and is substituted with short-chain phenyl or octyl nonpolar groups. The 
mobile phase is usually an aqueous salt solution. In reversed phase chromatography 
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the matrix is silica that has been substituted with longer H-alkyI chains, usually C 8 
(octylsilyl) or Ci 8 (octadecylsilyl). The matrix is less polar than the mobile phase. 
The mobile phase is usually a mixture of water and a less polar organic modifier. 

Separations on hydrophobic interaction chromatography matrices are usually 
5 done in aqueous salt solutions, which generally are nondenaturing conditions. 
Samples are loaded onto the matrix in a high-salt buffer and elution is by a 
descending salt gradient. Separations on reversed-phase media are usually done in 
mixtures of aqueous and organic solvents, which are often denaturing conditions. In 
the case of protein and/or peptide purification, hydrophobic interaction 
10 chromatography depends on surface hydrophobic groups and is carried out under 
conditions which maintain the integrity of the protein molecule. Reversed-phase 
chromatography depends on the native hydrophobicity of the protein and is carried 
out under conditions which expose nearly all hydrophobic groups to the matrix, Le., 
denaturing conditions. 

15 Ion-exchange chromatography is designed specifically for the separation of 

ionic or ionizable compounds. The stationary phase (column matrix material) carries 
ionizable functional groups, fixed by chemical bonding to the stationary phase. 
These fixed charges carry a counterion of opposite sign. This counterion is not fixed 
and can be displaced. Ion-exchange chromatography is named on the basis of the 

20 sign of the displaceable charges. Thus, in anion ion-exchange chromatography the 
fixed charges are positive and in cation ion-exchange chromatography the fixed 
charges are negative. 

Retention of a molecule on an ion-exchange chromatography column 
involves an electrostatic interaction between the fixed charges and those of the 

25 molecule, binding involves replacement of the nonfixed ions by the molecule. 
Elution, in turn, involves displacement of the molecule from the fixed charges by a 
new counterion with a greater affinity for the fixed charges than the molecule, and 
which then becomes the new, nonfixed ion. 

The ability of counterions (salts) to displace molecules bound to fixed charges 

30 is a function of the difference in affinities between the fixed charges and the nonfixed 
charges of both the molecule and the salt. Affinities in turn are affected by several 
variables, including the magnitude of the net charge of the molecule and the 
concentration and type of salt used for displacement. 

Solid-phase packings used in ion-exchange chromatography include cellulose, 

35 dextrans, agarose, and polystyrene. The exchange groups used include DEAE 
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(diethylaminoethyl), a weak base, that will have a net positive charge when ionized 
and will therefore bind and exchange anions; and CM (carboxymethyl), a weak acid, 
with a negative charge when ionized that will bind and exchange cations. Another 
form of weak anion exchanger contains the PEI (polyethyleneimine) functional 
5 group. This material, most usually found on thin layer sheets, is useful for binding 
proteins at pH values above their pi. The polystyrene matrix can be obtained with 
quaternary ammonium functional groups for strong base anion exchange or with 
sulfonic acid functional groups for strong acid cation exchange. Intermediate and 
weak ion-exchange materials are also available. Ion-exchange chromatography need 

10 not be performed using a column, and can be performed as batch ion-exchange 
chromatography with the slurry of the stationary phase in a vessel such as a beaker. 

Gel filtration is performed using porous beads as the chromatographic 
support. A column constructed from such beads will have two measurable liquid 
volumes, the external volume, consisting of the liquid between the beads, and the 

15 internal volume, consisting of the liquid within the pores of the beads. Large 
molecules will equilibrate only with the external volume while small molecules will 
equilibrate with both the external and internal volumes. A mixture of molecules 
(such as proteins) is applied in a discrete volume or zone at the top of a gel filtration 
column and allowed to percolate through the column. The large molecules are 

20 excluded from the internal volume and therefore emerge first from the column while 
the smaller molecules, which can access the internal volume, emerge later. The 
volume of a conventional matrix used for protein purification is typically 30 to 100 
times the volume of the sample to be fractionated. The absorbance of the column 
effluent can be continuously monitored at a desired wavelength using a flow monitor. 

25 A technique that is often applied to the purification of proteins is High 

Performance Liquid Chromatography (HPLC). HPLC is an advancement in both the 
operational theory and fabrication of traditional chromatographic systems. HPLC 
systems for the separation of biological macromolecules vary from the traditional 
column chromatographic systems in three ways; (1) the column packing materials are 

30 of much greater mechanical strength, (2) the particle size of the column packing 
materials has been decreased 5- to 10-fold to enhance adsorption-desorption kinetics 
and diminish bandspreading, and (3) the columns are operated at 10-60 times higher 
mobile-phase velocity. Thus, by way of non-limiting example, HPLC can utilize 
exclusion chromatography, ion-exchange chromatography, hydrophobic interaction 

35 chromatography, reversed-phase chromatography and immobilized metal affinity 
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chromatography. Art-recognized techniques for the purification of proteins and 
peptides are set forth in Methods in Enzymology, Vol. 182, Guide to Protein 
Purification, Murray P. Deutscher, ed (1990), which publication is incorporated 
herein by reference. 

5 In another aspect, the present invention is directed to methods of reducing the 

level of expression of l-deoxy-D-xyIulose-5-phosphate reductoisomerase protein in a 
host cell, such as a plant cell. A number of methods can be used to inhibit gene 
expression in plants. For instance, antisense RNA technology can be conveniently 
used. The successful implementation of anti-sense RNA in developmental systems 

10 to inhibit gene expression has previously been demonstrated (Van der Krol et al., 
1990 Plant Mol Biol 14:457; Visser et al., 1991, Mol Gen. Genet. 225:289; 
Hamilton et al., 1990, Nature 346:284; Stockhaus et al., 1990, EMBO J. 9:3013; 
Hudson et al., 1992, Plant Physiol 98:294; U.S. Patent Nos.: 4,801,340, 5,773,692, 
5,723,761, and 5,959,180). For example, polygalacturonase has been implicated in 

15 the process of fruit softening during the latter stages of ripening in tomato (Hiatt et 
al., 1989 in Genetic Engineering, Sedow, ed. p. 49; Sheehy et al., 1988, Proc. Natl 
Acad. Set USA 85:8805; Smith et al., 1988, Nature 334:724). The integration of 
anti-sense constructs into the tomato genome, under the control of the CaMV 35S 
promoter, has resulted in a 90% suppression of gene expression. 

20 The anti-sense gene is a DNA sequence that is inverted relative to its normal 

orientation for transcription and so expresses an RNA transcript that is 
complementary to a target mRNA molecule expressed within the host cell (i.e., the 
RNA transcript of the anti-sense gene can hybridize to the target mRNA molecule 
through Watson-Crick base pairing). An anti-sense gene may be constructed in a 

25 number of different ways provided that it is capable of interfering with the expression 
of a target gene, such as a l-deoxy-D-xylulose-5-phosphate reductoisomerase gene. 
The anti-sense gene can be constructed by inverting the coding region (or a portion 
thereof) of the target gene relative to its normal orientation for transcription to allow 
the transcription of its complement, hence the RNAs encoded by the anti-sense and 

30 sense gene are complementary. 

The anti-sense gene generally will be substantially identical to at least a 
portion of the target gene or genes. The sequence, however, need not be perfectly 
identical to inhibit expression. Generally, higher homology can be used to 
compensate for the use of a shorter anti-sense gene. The anti-sense gene generally 

35 will be substantially identical (although in antisense orientation) to the target gene. 
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The minimal identity will typically be greater than about 65%, but a higher identity 
might exert a more effective repression of expression of the endogenous sequences. 
Substantially greater identity of more than about 80% is preferred, though about 95% 
to absolute identity would be most preferred. 

Furthermore, the anti-sense gene need not have the same intron or exon 
pattern as the target gene, and non-coding segments of the target gene may be equally 
effective in achieving anti-sense suppression of target gene expression as coding 
segments. Normally, a DNA sequence of at least about 30 or 40 nucleotides should 
be used as the anti-sense gene, although a longer sequence is preferable. The 
construct is then transformed into one or more plant cells (from which whole plants 
can be regenerated as described herein) and the antisense strand of RNA is produced. 

Catalytic RNA molecules or ribozymes can also be used to inhibit expression 
of target genes. It is possible to design ribozyme transgenes that encode RNA 
ribozymes that specifically pair with a target RNA and cleave the phosphodiester 
backbone at a specific location, thereby functionally inactivating the target RNA. In 
carrying out this cleavage, the ribozyme is not itself altered, and is thus capable of 
recycling and cleaving other molecules. The inclusion of ribozyme sequences within 
antisense RNAs confers RNA-cleaving activity upon them, thereby increasing the 
activity of the antisense constructs. 

One class of ribozymes is derived from a number of small circular RNAs 
which are capable of self-cleavage and replication in plants. The RNAs replicate 
either alone (viroid RNAs) or with a helper virus (satellite RNAs). Examples include 
RNAs from avocado sunblotch viroid and the satellite RNAs from tobacco ringspot 
virus, lucerne transient streak virus, velvet tobacco mottle virus, solanurn nodiflorum 
mottle virus and subterranean clover mottle virus. The design and use of target 
RNA-specific ribozymes is described in Haseloff et al. (1988 Nature, 334:585- 
591)(see also U.S. Patent No.: 5,646,023), both of which publications are 
incorporated herein by reference. Tabler etal. (1991, Gene 108:175) have greatly 
simplified the construction of catalytic RNAs by combining the advantages of the 
anti-sense RNA and the ribozyme technologies in a single construct. Smaller regions 
of homology are required for ribozyme catalysis, therefore this can promote the 
repression of different members of a large gene family if the cleavage sites are 
conserved. 

Another method of suppressing target gene expression is sense suppression. 
Introduction of a nucleic acid molecule configured in the sense orientation has been 
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recently shown to be an effective means by which to block the transcription of target 
genes. For an example of the use of this method to modulate expression of 
endogenous genes see, Napoli et al., (1990 Plant Cell 2:279-289), and U.S. Pat. Nos. 
5,034,323, 5,231,020, 5,283,184 and 5,942,657, each of which publications are 
5 incorporated herein by reference. For sense suppression, the introduced sequence, 
needing less than absolute identity, also need not be full length, relative to either the 
primary transcription product or fully processed mRNA. This may be preferred to 
avoid concurrent production of some plants which are overexpressers. A higher 
identity in a shorter than full length sequence may compensate for a longer, less 
10 identical sequence. Furthermore, the introduced sequence need not have the same 
intron or exon pattern, and identity of non-coding segments will be equally effective. 
Normally, a sequence of the size ranges noted above for antisense regulation is used. 

More recently, a new method of suppressing the expression of a target gene 
has been developed. This method involves the introduction into a host cell of an 
15 inverted repeat transgene that directs the production of mRNAs that self-anneal to 
form double stranded (ds) RNA structures (Vionnet etal., 1998 Cell 95:177-187; 
Waterhouse et al., 1998 Proc. Natl. Acad Sci. USA 95:13959-13964; Misquitta et al., 
1999 Proc. Natl. Acad. Sci. USA 96:1451-1456; Baulcombe, 1999 Current Opinion 
Plant Biol 2:109-113; Sharp, 1999 Genes and Develop. 13:139-141). The ds RNA 
20 molecules, in a manner not understood, interfere with the post transcriptional 
expression of endogenous genes that are homologous to the dsRNA. It has been 
shown that the region of dsRNA homology must contain a region that is homologous 
to an exon portion of the target gene. Thus, the dsRNA may include sequences that 
are homologous to noncoding portions of the target gene. Alternatively, gene 
25 suppressive dsRNA could also be produce by transforming a cell with two different 
transgenes, one expressing a sense RNA and the other a complementary antisense 
RNA. 

A construct containing an inverted repeat of a transcribed sequence of a target 
gene can be made, for example, by following the guidance provided by Waterhouse 

30 et al.(1998), supra. The inverted repeat part of the construct comprises about 200 to 
1500 bp of transcribed DNA repeated in a head to head or tail to tail arrangement. 
The repeats are separated by about 200 to 1500 bp of non repeated DNA which can 
also be part of the transcribed region of the target gene, or can be from a different 
gene, and perhaps contain an intron. A suitable inverted repeat construct may be 

35 made by attaching in the following order: a plant promoter; a 3* region from a target 
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cDNA oriented in the "sense" orientation; a 5' region from the target cDNA; the same 
3' region of the target cDNA coding sequence but oriented in "anti-sense" orientation; 
and finally a polyA addition signal. The transcribed RNA resulting from introduction 
of the inverted repeat transgene into a target plant will have the potential of forming 
an internal dsRNA region containing sequences from the target gene that is to be 
suppressed. The dsRNA sequences are chosen to suppress a single, or perhaps 
multiple, target gene(s). In some cases, the sequences with the potential for dsRNA 
formation may originate from two or more related, target genes (e.g., members of a 
gene family). 

An additional strategy suitable for suppression of target gene activity entails 
the sense expression of a mutated or partially deleted form of the protein encoded by 
the target gene according to general criteria for the production of dominant negative 
mutations (Herskowitz I, Nature 329: 219-222 (1987)). Examples of strategies that 
produced dominant negative mutations are provided (Mizukami, 1996; Emmler, 
1995; Sheen, 1998; and Paz-Ares, 1990). 

Wild-type target gene function can also be eliminated or diminished by using 
DNA regions flanking the target gene to mediate an insertional disruption of the 
target gene coding sequence (Miao etal., 1995; Plant J. 7:359-365; Kempin etal., 
1997 Nature 389:802-803). The targeted gene replacement is mediated by 
homologous recombination between sequences in a transformation vector that 
includes DNA regions flanking the target gene and the corresponding chromosomal 
sequences. A selectable marker, such as kanamycin, bar or pat, or a screenable 
marker, such as beta-glucuronidase (GUS), is included in between the target gene 
flanking regions. These markers facilitate the identification of cells that have 
undergone target gene replacement. 

The following examples merely illustrate the best mode now contemplated for 
practicing the invention, but should not be construed to limit the invention. 

EXAMPLE 1 

Isolation of a cDNA Molecule Encoding a l-deoxv-D-xylulose-5-phosphate 
Reductoisomerase from Peppermint (Mentha piperita) 
A cDNA library was constructed from mRNA from isolated peppermint oil 
gland secretory cells, a cell type highly specialized for essential oil biosynthesis 
(Lange, B.M., and Croteau, R. (1999) Curr. Opin. Plant Biol. 2:139-144 (1999)). 
Based on likely conserved regions of the reductoisomerase gene, PGR primers were 
designed (PI, S'-CGAGATTATGCCAGGAGAGC^* (SEQ ID NO:3); P2, 
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S'-GGCTTCAGGCAAACCCTTG-SXSEQ ID NO:4)) and employed with peppermint 
oil gland library cDNA as template to amplify a 223 bp fragment designated 
pMPDXRl (SEQ ID NO:5) with significant homology to the E. coli 
reductoisomerase gene. By screening the peppermint oil gland cDNA library (2.5 x 
5 10 4 plaques) with a labeled probe derived from pMPDXRl (SEQ ID NO:5), five full- 
length clones were obtained, including the cDNA molecule having the nucleic acid 
sequence set forth in SEQ ID NO:l. 

EXAMPLE 2 

Isolation of a cDNA Molecule Encoding a l-deoxv-D-xvlulose-5-phosphate 
10 Reductoisomerase from Arabidopsis thaliana 

2 x 10 4 plaques of an A. thaliana flower bud cDNA library (CD4-6 from the 
Arabidopsis Biological Resource Center (http://aims.cps.msu.edu/aims/)) were 
screened with pMPDXRl (SEQ ID NO:5) and afforded 20 positive clones, such as 
the cDNA molecule having the sequence set forth in SEQ ID NO:6, all of which were 
15 slightly 5'-truncated. The conditions for screening the A. thaliana flower bud cDNA 
library were: hybridization in 5 X SSC at 65°C for 16 hours, followed by two washes 
in 2 X SSC at room temperature for 20 minutes per wash, then one wash in 1 X SSC 
at 55°C for 30 minutes. 

EXAMPLE 3 

20 Functional expression of l-deoxv-D-xylulose-5-phosphate reductoisomerases from 

Peppermint (Mentha piperita) and E. coli 
An additional primer set (P3, 

5 , -GTCTCAACTCTGGAAGCTTTATGAAGCAACTCTCAC-3 , ; (SEQ ID NO:8) 
and P4, 5-CTCTGTAGCCGGACCTAGGTCAGCTTGCGAGAC-3' (SEQ ID 
25 NO:9)) was employed to amplify a full-length E. coli reductoisomerase gene, and the 
resulting amplicon was inserted into pBluescript KS(-) for use as a positive control in 
the functional expression of the enzyme. 

The full-length peppermint 1 -deoxy-D-xylulose-5-phosphate 
reductoisomerase cDNA (designated pMPDXRl 8 (SEQ ID NO:l)) and the E. coli 
30 reductoisomerase clone (pECDXR20) were evaluated by expression in E. coli for the 
ability to catalyze the rearrangement and pyridine nucleotide-dependent reduction of 
l-deoxy-D-xylulose-5-phosphate to 2-C-methyl-D-erythritol-4-phosphate. 

FIGURE 2 shows GC-MS analysis of (A) the trimethylsilyl ether derivative 
of the dephosphorylated biosynthetic product (R t = 7.1 ±0.1 min) generated by 
35 recombinant peppermint l-deoxy-D-xylulose-5-phosphate reductoisomerase (SEQ ID 
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NO:2), and (B) the trimethylsilyl ether derivative of authentic 2-C-methyl-D,L- 
erythritol (R { = 7.1 ± 0.1 min) identically prepared. The slight difference in relative 
intensity of ions in the m/z 1 16, 13 1, 147 cluster in spectrum A is due to background 
subtraction of contaminants in the case of the biosynthetic product for which the total 
5 ion abundance was tenfold less than for the standard (B). For enzyme preparation, 
transformed E. coli cells were grown to A 600 of 0.5 at 37°C in 50 ml of Luria-Bertani 
medium supplemented with appropriate antibiotics. Cells harboring the peppermint 
l-deoxy-D-xylulose-5-phosphate reductoisomerase cDNA (SEQ ID NO:l) were then 
incubated at 20°C for 2 h, induced with 0.1 mM IPTG, and maintained at 20°C for 
10 15 h. Cells harboring a nucleic acid sequence encoding 

l-deoxy-D-xylulose-5-phosphate reductoisomerase from E.coli were similarly 
induced, but with 1 mM IPTG, and maintained at 37°C for 5 h. Bacteria were 
harvested by centrifugation, washed with 1 ml of assay buffer (0.1 M Tris/HCl 
(pH 7.5) containing 2 mM MnCl 2 and 0.5 mM NADPH), resuspended in 1 ml of 
15 assay buffer, and then disrupted by brief sonication at 0-4°C. The resulting 
homogenates were centrifuged to pellet debris, and an aliquot (15^1) of each 
preparation was incubated with 0.1 mmol [l- I4 C]deoxyxylulose phosphate 
(18.5 kBq) for 10 min at 23°C. To the reaction mixtures, 50 \i\ of 10 mM NaHC0 3 
was added, the suspensions were filtered through Nanosep columns (Pall Filtron; 
20 30,000 kDa cut-off), and the filtrates were analyzed by modification of an established 
reversed-phase ion-pair radio-HPLC method (McCaskill, D. and Croteau, R., Anal 
Biochem. 215: 142-149 (1998)) using 10 mM tetrabutylammonium acetate as ion- 
pairing reagent. Enzyme assays from both sources revealed the presence of a new 
radiolabeled product at R| = 34.0 min, which was isolated by semipreparative HPLC 
25 as above. Following solvent removal under vacuum, the residual material was 
dissolved in 50 ^1 of 0.1 M potassium phosphate buffer (pH 5.0) to which 10 units of 
wheat germ acid phosphatase were added (Sigma) followed by incubation at 23°C for 
2 h. The reaction was terminated by addition of 50 fil of acetone, followed by 
centrifugation, transfer of the supernatant and removal of solvent under vacuum. The 
30 residual material was dissolved in 20 ^1 of anhydrous diethyl ether and converted to 
the trimethylsilyl ether derivative for GC-MS analysis as previously described 
(Lange, B.M. et al., Proc. Nad Acad Sci. USA 95, 2100-2104 (1998)). The mass 
spectra of the products derived from the recombinant peppermint i-deoxy-D- 
xylulose-5-phosphate reductoisomerase (SEQ ID NO:2) and E. coli reductoisomerase 
35 were identical. 
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The derivatized product from both sources exhibited the same retention time 
(7.1±0.1min) and mass spectrum as an authentic sample of 
2-C-methyl-D,L-erythritol identically derivatized (FIGURE 2B), thereby confirming 
the identity of the plant l-deoxy-D-xylulose-5-phosphate reductoisomerase (SEQ ID 
NO:2) and indicating that the plant enzyme (SEQ ID NO:2) is active in the preprotein 
form. 

The reaction catalyzed by ketol acid reductoisomerase, which the reaction 
catalyzed by l-deoxy-D-xylulose-5-phosphate reductoisomerase resembles, obeys an 
ordered mechanism in which NADPH and the metal ion cofactor bind first, followed 
by the acetohydroxy acid substrate (Chunduru, S.K. et aL, Biochemistry 28, 486-493 
(1989)). Since NADPH and manganese (or magnesium) are also required for the 
enzymatic conversion of deoxyxylulose phosphate to methylerythritol phosphate (no 
intermediates, such as methylerythrose phosphate, were observed in the presence or 
absence of these cofactors), a similar reaction mechanism may be postulated for 
deoxyxylulose phosphate reductoisomerase. 

EXAMPLE 4 

Sequence Analysis of l-deoxv-D-xvlulose-5-phosphate reductoisomerase from 
Peppermint (Mentha piperita) 
The peppermint 1 -deoxy-D-xyIulose-5-phosphate reductoisomerase cDNA 
(SEQ ID NO:l) contains an open reading frame of 1425 bp encoding a protein of 
475 deduced amino acid residues (SEQ ID NO:2). The first 73 amino acids display 
typical characteristics of plastidial targeting sequences (von Heijne, G.et al., Eur. J. 
Biochem. 180, 535-545 (1989)), consistent with the subcellular localization of this 
enzyme in plant plastids where the mevalonate-independent pathway operates 
(Schwarz, M.K. (1994) Ph.D. Thesis, Eidgenossische Technische Hochschule, 
Zurich, Switzerland; Lichtenthaler, H.K., Schwender, J., Disch, A., and Rohmer, M. 
(1997) FEBS Lett. 400, 271-274.). When the residues defining the putative transit 
peptide are excluded, the size of the mature enzyme is estimated at about 43.5 kDa. 
Alignment of translated sequences (devoid of plastidial targeting peptides where 
appropriate) reveals significant homology between the peppermint 1-deoxy-D- 
xyIulose-5-phosphate reductoisomerase (SEQ ID NO:2) and the putative 1-deoxy-D- 
xylulose-5-phosphate reductoisomerase fragment from A. thaliana (SEQ ID NO:7) 
(88.0% similarity/84.2% identity), as well as with SLL0019 from the cyanobacterium 
Synechocystis sp. PCC6803 (72.3/63.7%), BG13409 from Bacillus subtilis 
(56.9/45.5%), the reductoisomerase of E. colt (53.4/43.0%), HI0807 from 
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HaemopMus influenzae (55.5/41.8%), Rv2870c from Mycobacterium tuberculosis 
(52.8/43.6%), and HP0216 from Helicobacter pylori (50.7/38.1%). The peppermint 
cDNA (SEQ ID NO:2) also shows significant homology to a highly truncated, 
Arabidopsis cDNA fragment of unknown function (SEQ ID NO: 10), deposited in the 
5 Genbank database as Accession Number T43949, and to the exon portions of an 
Arabidopsis genomic clone of unknown function (SEQ ID NO:l 1), deposited in the 
Genbank database as Accession Number AB009053. Note that SEQ ID NO:l 1 sets 
forth the sequence of the negative (i.e., non-coding) strand of the Arabidopsis 
genomic clone. 

10 Although the reaction mechanism of l-deoxy-D-xylulose-5-phosphate 

reductoisomerase and of ketol acid reductoisomerase, which catalyzes the 
rearrangement and reduction of 2-acetolactate to 2,3-dihydroxyisovalerate and of 
2-aceto-2-hybroxybutyrate to 2,3-dihydroxy-3-methylvalerate in the biosynthesis of 
branched-chain amino acids (Mrachko, G.T. et aL, Arch. Biochem. Biophys. 294, 

15 446-453 (1992)), share some similarity, the deduced amino acid sequences of these 
enzymes are quite distinct (-35% similarity). However, the N-terminus of the 1- 
deoxy-D-xylulose-5-phosphate reductoisomerase sequences contains a conserved 
motif (GSTGSIG)(SEQ ID NO: 12) with some homology to the signature sequence of 
the proposed NADPH binding site of ketol acid reductoisomerase 

20 (GXGXXGXXXG)(SEQ ID NO:13) (Rane, M.J., and Calvo, K.C., Arch Biochem. 
Biophys. 338, 83-89(1997)). 

The isolation of cDNAs encoding both deoxyxylulose phosphate synthase 
(Lange, B.M. et al., Proc. Natl Acad Set USA 95, 2100-2104 (1998)) and 1-deoxy- 
D-xylulose-5-phosphate reductoisomerase (SEQ ID NO:l) from peppermint provides 

25 substantial evidence for the operation of similar catalytic machinery in the 
pyruvate/glyceraldehyde-3-phosphate pathway in plant plastids and several 
eubacteria. Since this essential pathway is present in plants and bacteria but 
apparently not in animals, both the synthase and reductoisomerase are targets for the 
development of novel classes of highly specific herbicides, antimalarials (Jomaa, D. 

30 et al., Science 285:1573-1576 (1999) and antibiotics (Kuzuyama, T. et al., 
Tetrahedron Lett, 39, 7913-7916 (1998)). Whereas deoxyxylulose phosphate serves 
as the precursor for the biosynthesis of thiamin (Julliard, J.H., and Douce, R. Proc. 
Natl. Acad Sci. USA 88, 2042-2045 (1991)) and probably pyridoxol (Hill, R.E. et al., 
J. Biol Chem. 271, 30426-30435 (1996)) in higher plants, as well as isopentenyl 

35 diphosphate (McCaskill, D., and Croteau, R., Tetrahedron Letts. 40, 653-656 
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(1999)), the l-deoxy-D-xylulose-5-phosphate reductoisomerase catalyzes the first 
committed step in the conversion of this common intermediate to plastidial 
isoprenoids, including carotenoids and the prenyl side-chains of chlorophyll and 
plastoquinone (Bouvier, F. et al., Plant Physiol 117,1423-1431 (1998)). This 
specific transformation may be expected to be a regulated (and potentially rate- 
limiting) step of isoprenoid biosynthesis in plastids. 

EXAMPLE 5 

Physical Properties of Presently Preferred l-deoxv-D-xvIulose-5-phosphate 
Reductoisomerase Proteins of the Present Invention 
Table 1 sets forth physical properties of presently preferred 
l-deoxy-D-xylulose-5-phosphate reductoisomerase proteins of the present invention. 
Table 1 



Native Molecular Weight of 
Monomeric protein (excluding transit 
peptide) 


40,000 to 45,000 


pi 


5.5 to 6.0 


pH optimum 


7.0 to 8.0 


Cofactor Utilization 


Requires divalent metal cation 
(e.g., Mn 2+ , Mg 2+ ) and a reduced 
pyridine nucleotide (NADPH or 
possibly NADH) 



EXAMPLE 6 



Hybridizati on of a Portion of the Peppermint (Mentha x piperita) 
l-deoxv-P- xvlulose-5-phosphate Reductoisomerase cDNA (SEP ID NO:l) to Other 
Nucleic Acid Sequences of the Present Invention 
The portion of the peppermint l-deoxy-D-xylulose-5-phosphate 
reductoisomerase cDNA clone (SEQ ID NO:l) extending from nucleotide 230 to 
nucleotide 1496, and its complementary nucleic acid strand, were radiolabeled and 
used to probe a filter bearing RNA samples isolated from the following plants: 
Arabidopsis thaliana leaf tissue; tomato (Lycopersicon esculentum) leaf tissue; corn 
(Zea mays) leaf tissue; and Grand fir (Abies grandis) needles. Hybridization and 
washing were conducted by utilizing the technique of hybridizing radiolabelled 
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nucleic acid probes to nucleic acids immobilized on nitrocellulose filters or nylon 
membranes as set forth at pages 9.52 to 9.55 of Molecular Cloning, A Laboratory 
Manual (2nd edition), J. Sambrook, E.F. Fritsch and T. Maniatis eds, the cited pages 
of which are incorporated herein by reference. Hybridization was in 3 X SSC at 65°C 
for 16 hours, followed by two washes in 2 X SSC at 23°C for 20 minutes per wash, 
followed by one wash in 0.5 X SSC at 55°C for 30 minutes. 

A single mRNA band was detected in each RNA sample in the predicted 1.7 
to 2.0 kb size range. The predicted size of the mRNAs corresponding to the cloned 
peppermint (SEQ ID NO:l) and Arabidopsis (SEQ ID NO:6) cDNAs is 
approximately 1.7kb. These results demonstrate that the sequences of mRNA 
molecules encoding l-deoxy-D-xylulose-5-phosphate reductoisomerase are highly 
conserved amongst a broad range of phylogenetically distant plant species. 

While the preferred embodiment of the invention has been illustrated and 
described, it will be appreciated that various changes can be made therein without 
departing from the spirit and scope of the invention. 
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The embodiments of the invention in which an exclusive property or privilege 
is claimed are defined as follows: 

1. . An isolated nucleic acid molecule that hybridizes under stringent 
conditions to the nucleic acid molecule of SEQ ID NO:l, or to the complement of the 
nucleic acid molecule of SEQ ID NO:l, provided that said isolated nucleic acid 
molecule does not consist of a nucleic acid sequence selected from the group 
consisting of SEQ ID NO: 10 and SEQ ID NO: 11 or a nucleic acid sequence 
complementary to a nucleic acid sequence selected from the group consisting of SEQ 
ID NO: 1 0 and SEQ ID NO: 1 1 . 

2. An isolated nucleic acid molecule of Claim 1 wherein said stringent 
conditions comprise washing in 2.0 X SSC at 50°C for 30 minutes. 

3. An isolated nucleic acid molecule of Claim 1 wherein said isolated 
nucleic acid molecule encodes a plant l-deoxy-D-xylulose-5-phosphate 
reductoisomerase protein. 

4. An isolated nucleic acid molecule of Claim 1 wherein said isolated 
nucleic acid molecule encodes an essential oil plant l-deoxy-r>xylulose-5-phosphate 
reductoisomerase protein. 

5. An isolated nucleic acid molecule of Claim 4 wherein said isolated 
nucleic acid molecule encodes a Mentha l-deoxy-D-xylulose-5-phosphate 
reductoisomerase protein. 

6. An isolated nucleic acid molecule of Claim 1 wherein said nucleic 
acid molecule' encodes a l-deoxy-D-xylulose-5-phosphate reductoisomerase protein 
comprising the amino acid sequence set forth in SEQ ID NO:2. 

7. An isolated nucleic acid molecule of Claim 1 comprising the nucleic 
acid sequence of SEQ ID NO: 1 . 

8. An isolated plant l-deoxy-D-xylulose-5-phosphate reductoisomerase 

protein. 

9. An isolated essential oil plant l-deoxy-D-xylulose-5-phosphate 
reductoisomerase protein of Claim 8. 

10. An isolated Mentha l-deoxy-D-xylulose-5-phosphate 
reductoisomerase protein of Claim 8. 

U. An isolated Mentha 1 -deoxy-D-xylulose-5 -phosphate 
reductoisomerase protein of Claim 8, said protein comprising the amino acid 
sequence set forth in SEQ ID NO:2. 
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12. A replicable vector comprising a first nucleic acid molecule that 
hybridizes under stringent conditions to a second nucleic acid molecule consisting of 
the nucleic acid sequence set forth in SEQ ID NO:l, or to a third nucleic acid 
molecule consisting of the complement of the nucleic acid sequence set forth in SEQ 
ID NO:l, provided that said first nucleic acid molecule does not consist of a nucleic 
acid sequence selected from the group consisting of SEQ ID NO: 10 and SEQ ID 
NO:l 1 or a nucleic acid sequence complementary to a nucleic acid sequence selected 
from the group consisting of SEQ ID NO:l 0 and SEQ ID NO: 1 1 . 

13. A replicable vector of Claim 12 wherein said first nucleic acid 
molecule encodes a plant l-deoxy-D-xylulose-5-phosphate reductoisomerase protein. 

14. A replicable vector of Claim 12 wherein said first nucleic acid 
molecule encodes a Mentha l-deoxy-D-xylulose-5-phosphate reductoisomerase 
protein. 

15. A replicable vector of Claim 12 wherein said first nucleic acid 
molecule encodes a l-deoxy-r>xylulose-5-phosphate reductoisomerase protein 
comprising the amino acid sequence set forth in SEQ ID NO:2. 

16. A replicable vector of Claim 12 wherein said first nucleic acid 
molecule comprises the nucleic acid sequence set forth in SEQ ID NO:l, or the 
complement of the nucleic acid sequence set forth in SEQ ID NO:l. 

17. A host cell comprising a vector of Claim 12. 

18. A host cell comprising a vector of Claim 1 6. 

1 9. A host cell of Claim 1 7 wherein said host cell is a plant cell. 

20. A host cell of Claim 1 8 wherein said host cell is a plant cell. 

21. A method of enhancing the level of expression of 
l-deoxy-D-xylulose-5-phosphate reductoisomerase protein in a host cell comprising 
introducing into said host cell a replicable expression vector comprising a nucleic 
acid molecule that encodes a l-deoxy-D-xyIulose-5-phosphate reductoisomerase 
protein under conditions that enable expression of said protein in said host cell. 

22. The method of Claim 21 wherein said nucleic acid molecule that 
encodes a l-deoxy-D-xylulose-5-phosphate reductoisomerase protein hybridizes 
under stringent conditions to the complement of the nucleic acid molecule of SEQ ID 
NO:l, said stringent conditions comprising washing in 2.0 X SSC at 50°C for 
30 minutes. 

23. A method of reducing the level of expression of 
l-deoxy-D-xylulose-5-phosphate reductoisomerase protein in a host cell comprising 
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introducing into said host cell a replicable expression vector comprising a nucleic 
acid molecule that expresses an RNA molecule that hybridizes under stringent 
conditions to the nucleic acid sequence of SEQ ID NO: 1 . 

24. The method of Claim 23 wherein said stringent conditions comprise 
washing in 2.0 X SSC at 50°C for 30 minutes. 
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SEQUENCE LISTING 

<110> Croteau, Rodney B 
Lange, Bernd M 

<120> l-DEOXY-D-XYLULOSE-5-PHOSPHATE REDUCTOISOMERASE, AND 
METHODS OF USE 

<130> WSUR14982 

<140> 
<141> 

<150> 60/118,349 
<151> 1999-02-03 

<160> 13 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 1759 
<212> DNA 

<213> Mentha piperita 

<220> 

<221> CDS 

<222> (72) . . (1496) 

<400> 1 

agaaagcacc tttctatttt cttcagcttt ctgcacattt gagcttgtga ttaaccatgg 60 

ctctaaactt g atg get eta aac ttg atg get cca act gaa ate aag act 110 
Met Ala Leu Asn Leu Met Ala Pro Thr Glu He Lys Thr 
1 5 10 

etc tct ttc ttg gat age tec aaa teg aat tac aat etc aat cct etc 158 
Leu Ser Phe Leu Asp Ser Ser Lys Ser Asn Tyr Asn Leu Asn Pro Leu 
15 20 25 

aag ttc caa ggt gga ttt get ttt aag agg aag gat agt aga tgc act 206 
Lys Phe Gin Gly Gly Phe Ala Phe Lys Arg Lys Asp Ser Arg Cys Thr 
30 35 40 45 

get gca aag aga gtc cat tgc tea gca cag tea cag tea ccg cct ccg 254 
Ala Ala Lys Arg Val His Cys Ser Ala Gin Ser Gin Ser Pro Pro Pro 
50 55 60 

get tgg ccc gga egg get ttt ccc gag ccc ggt cgt atg act tgg gag 302 
Ala Trp Pro Gly Arg Ala Phe Pro Glu Pro Gly Arg Met Thr Trp Glu 
65 70 75 

ggc ccg aag ccc att tea gtt att ggc tec act ggc tec att gga act 350 
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Gly Pro Lys Pro He Ser Val He Gly Ser Thr Gly Ser He Gly Thr 
80 85 90 

cag acg etc gac ata gtt get gaa aat ccg gat aaa ttt aga ate gtc 398 
Gin Thr Leu Asp He Val Ala Glu Asn Pro Asp Lys Phe Arg He Val 
95 100 105 

gca ctt gca get ggt tea aat gtc acc etc ctt get gat cag aag get 446 
Ala Leu Ala Ala Gly Ser Asn Val Thr Leu Leu Ala Asp Gin Lys Ala 
HO 115 120 125 

ttc aaa cct aaa tta gta tea gta aaa gac gag teg tta att agt gag 494 
Phe Lys Pro Lys Leu Val Ser Val Lys Asp Glu Ser Leu He Ser Glu 
130 135 140 

etc aaa gaa get ctg get ggt ttc gaa gat atg cct gaa att att cca 542 
Leu Lys Glu Ala Leu Ala Gly Phe Glu Asp Met Pro Glu He He Pro 
145 150 155 

gga gag cag ggg atg ate gag gtt get cgc cat cca gat get gtt act 590 
Gly Glu Gin Gly Met He Glu Val Ala Arg His Pro Asp Ala Val Thr 
160 165 , 170 

gta gta acg gga att gtc ggc tgt gca ggt ttg aag ccg aca gtg get 638 
Val Val Thr Gly He Val Gly Cys Ala Gly Leu Lys Pro Thr Val Ala 
175 180 185 

gec ata gaa get gga aag gac att get ttg gec aat aaa gag aca eta 686 
Ala He Glu Ala Gly Lys Asp He Ala Leu Ala Asn Lys Glu Thr Leu 
190 195 200 205 

ate get gga ggg cct ttt gtc ctt cct ctt gca aag aag cac aac gtc 734 
He Ala Gly Gly Pro Phe Val Leu Pro Leu Ala Lys Lys His Asn Val 
210 215 220 

aag att ctt cct gca gac tec gaa cat tct get ata ttt cag tgt ate 782 
Lys He Leu Pro Ala Asp Ser Glu His Ser Ala He Phe Gin Cys He 
225 230 235 

caa ggc ttg cca gaa ggt get ttg agg cgt ata att ttg act gca teg 830 
Gin Gly Leu Pro Glu Gly Ala Leu Arg Arg He He Leu Thr Ala Ser 
240 245 250 

gga gga get ttc agg gat ttg ccc gtt gag aaa ttg aaa gag gtg aaa 878 
Gly Gly Ala Phe Arg Asp Leu Pro Val Glu Lys Leu Lys Glu Val Lys 
255 260 265 

gta gca gat get tta aag cat tec aac tgg aat atg ggg aaa aag aat 926 
Val Ala Asp Ala Leu Lys His Ser Asn Trp Asn Met Gly Lys Lys Asn 
270 275 280 285 

aca gtg cga ctt ctg caa etc ttc ttt aac aag ggc etc gaa gtc ata 974 
Thr Val Arg Leu Leu Gin Leu Phe Phe Asn Lys Gly Leu Glu Val He 
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290 295 300 

aaa get cac tat ttg ttt ggg gca gaa tat gat gat att gag att gtt 1022 
Lys Ala His Tyr Leu Phe Gly Ala Glu Tyr Asp Asp lie Glu He Val 
305 310 315 

att cat tec cca tec ate att cac teg atg gtc gag aca cag gat tea 1070 
He His Ser Pro Ser He He His Ser Met Val Glu Thr Gin Asp Ser 
320 325 330 

teg gtg eta get caa tta gga tgg ccc gat atg cgt ttg cct att ctg 1118 
Ser Val Leu Ala Gin Leu Gly Trp Pro Asp Met Arg Leu Pro He Leu 
335 340 345 

tac acc tta tea tgg cca gag aga gtc tac tgc tec gag att aca tgg 1166 
Tyr Thr Leu Ser Trp Pro Glu Arg Val Tyr Cys Ser Glu He Thr Trp 
350 355 360 365 

cct cga etc gac etc tgc aag gtc gat tta cca ttc aag aag ccc gat 1214 
Pro Arg Leu Asp Leu Cys Lys Val Asp Leu Pro Phe Lys Lys Pro Asp 
370 375 380 

aac cgt gaa ata ccc get atg gat eta gee tat get get tgg aag age 1262 
Asn Arg Glu He Pro Ala Met Asp Leu Ala Tyr Ala Ala Trp Lys Ser 
385 390 395 

egg age acc atg acc gga gtt ctg age gca get aat gag aaa gca gtc 1310 
Arg Ser Thr Met Thr Gly Val Leu Ser Ala Ala Asn Glu Lys Ala Val 
400 405 410 

gaa atg ttc ate gac gag aaa ate ggc tac etc gac att ttc aag gtc 1358 
Glu Met Phe He Asp Glu Lys lie Gly Tyr Leu Asp He Phe Lys Val 
415 420 425 

gtg gag ctt aca tgc gac aag cat cga teg gaa atg gcg gtg teg cct 1406 
Val Glu Leu Thr Cys Asp Lys His Arg Ser Glu Met Ala Val Ser Pro 
430 435 440 445 

teg ttg gag gag ate gtt cac tac gac cag tgg gca cgc gac tac get 1454 
Ser Leu Glu Glu He Val His Tyr Asp Gin Trp Ala Arg Asp Tyr Ala 
450 455 460 

gca acg gtg ctg aaa teg gee ggt ttg agt cct get ctt gta 1496 
Ala Thr Val Leu Lys Ser Ala Gly Leu Ser Pro Ala Leu Val 
465 470 475 

tgagcagagg ttgatgcaaa tttgatcaac tggaagcttg ttcctttttc tttttttttg 1556 

ttctggtttt ccttcttact tttagggagg aagecattta ctatgaaaag gaaaggaatc 1616 

atgtgacttt gtgaaacagt cccaccatga aatagatata aaagaatcac aagattttgt 1676 

gttttatgat tttcatcaaa aagtgtaaat tttgatgtct cagattattt gtagcttaaa 1736 
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aggtgaataa acacagcagt tgg 1759 



<210> 2 
<211> 475 
<212> PRT 

<213> Mentha piperita 
<400> 2 

Met Ala Leu Asn Leu Met Ala Pro Thr Glu lie Lys Thr Leu Ser Phe 
15 10 15 

Leu Asp Ser Ser Lys Ser Asn Tyr Asn Leu Asn Pro Leu Lys Phe Gin 
20 25 30 

Gly Gly Phe Ala Phe Lys Arg Lys Asp Ser Arg Cys Thr Ala Ala Lys 
35 40 45 

Arg Val His Cys Ser Ala Gin Ser Gin Ser Pro Pro Pro Ala Trp Pro 
50 55 60 

Gly Arg Ala Phe Pro Glu Pro Gly Arg Met Thr Trp Glu Gly Pro Lys 
6 5 70 75 80 

Pro He Ser Val He Gly Ser Thr Gly Ser He Gly Thr Gin Thr Leu 
85 90 95 

Asp lie Val Ala Glu Asn Pro Asp Lys Phe Arg He Val Ala Leu Ala 
100 105 HO 

Ala Gly Ser Asn Val Thr Leu Leu Ala Asp Gin Lys Ala Phe Lys Pro 
115 120 125 

Lys Leu Val Ser Val Lys Asp Glu Ser Leu He Ser Glu Leu Lys Glu 
130 135 140 

Ala Leu Ala Gly Phe Glu Asp Met Pro Glu He He Pro Gly Glu Gin 
145 150 155 i 60 

Gly Met He Glu Val Ala Arg His Pro Asp Ala Val Thr Val Val Thr 
. 165 170 175 

Gly lie Val Gly Cys Ala Gly Leu Lys Pro Thr Val Ala Ala He Glu 
180 185 190 

Ala Gly Lys Asp He Ala Leu Ala Asn Lys Glu Thr Leu He Ala Gly 
195 200 205 

Gly Pro Phe Val Leu Pro Leu Ala Lys Lys His Asn Val Lys He Leu 
210 215 220 



Pro Ala Asp Ser Glu His Ser Ala He Phe Gin Cys He Gin Gly Leu 
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225 



230 



235 



240 



Pro Glu Gly Ala 



Leu Arg Arg lie lie Leu Thr Ala Ser Gly Gly Ala 
245 250 255 



Phe Arg Asp Leu 
260 



Pro Val Glu Lys Leu Lys Glu Val Lys Val Ala Asp 



265 270 



Ala Leu Lys His 



Ser Asn Trp Asn Met Gly Lys Lys Asn Thr Val Arg 
280 285 



275 



Leu Leu Gin Leu Phe Phe Asn Lys Gly Leu Glu Val He Lys Ala His 
290 295 300 

Tyr Leu Phe Gly Ala Glu Tyr Asp Asp He Glu He Val He His Ser 
305 310 315 320 

Pro Ser He He His Ser Met Val Glu Thr Gin Asp Ser Ser Val Leu 
325 330 335 

Ala Gin Leu Gly Trp r Pro Asp Met Arg Leu Pro He Leu Tyr Thr Leu 
340 345 350 

Ser Trp Pro Glu Arg Val Tyr Cys Ser Glu He Thr Trp Pro Arg Leu 
355 360 365 

Asp Leu Cys Lys Val Asp Leu Pro Phe Lys Lys Pro Asp Asn Arg Glu 
370 375 380 

He Pro Ala Met Asp Leu Ala Tyr Ala Ala Trp Lys Ser Arg Ser Thr 
385 390 395 400 

Met Thr Gly Val Leu Ser Ala Ala Asn Glu Lys Ala Val Glu Met Phe 
405 410 415 

He Asp Glu Lys He Gly Tyr Leu Asp He Phe Lys Val Val Glu Leu 
420 425 430 

Thr Cys Asp Lys His Arg Ser Glu Met Ala Val Ser Pro Ser Leu Glu 
435 440 445 

Glu He Val His Tyr Asp Gin Trp Ala Arg Asp Tyr Ala Ala Thr Val 
450 455 460 

Leu Lys Ser Ala Gly Leu Ser Pro Ala Leu Val 
465 470 475 



<210> 3 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<220> 

<221> misc_f eature 
<222> (1) . . (20) 
<223> PCR primer PI 

<400> 3 

cgagattatg ccaggagagc 20 

<210> 4 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<220> 

<221> miscjeature 
<222> (1) . . (19) 
<223> PCR primer P2 

<400> 4 

ggcttcaggc aaacccttg 19 

<210> 5 
<211> 270 
<212> DNA 

<213> Mentha piperita 
<400> 5 

tgaaattatt ccaggagagc aggggatgat cgaggttgct cgccatccag atgctgttac 60 
tgtagtaacg ggaattgtcg gctgtgcagg tttgaagccg acagtggctg ccatagaagc 120 
tggaaaggac attgctttgg ccaataaaga gacactaatc gctggagggc cttttgtcct 180 
tcctcttgca aagaagcaca acgtcaagat tcttcctgca gactccgaac attctgctat 240 
atttcagtgt atccaaggct tgccagaagg 270 



<210> 6 
<211> 1197 
<212> DNA 

<213> Arabidopsis thaliana 
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<220> 

<221> CDS 

<222> (1) . . (1197) 

<400> 6 

gga cca aaa ccc ate tct ate gtt gga tct act ggt tct att ggc act 48 

Gly Pro Lys Pro He Ser lie Val Gly Ser Thr Gly Ser He Gly Thr 
1 5 10 15 

cag aca ttg gat att gtg get gag aat cct gac aaa ttc aga gtt gtg 96 
Gin Thr Leu Asp He Val Ala Glu Asn Pro Asp Lys Phe Arg Val Val 
20 25 30 

get eta get get ggt teg aat gtt act eta ctt get gat cag gta agg 144 
Ala Leu Ala Ala Gly Ser Asn Val Thr Leu Leu Ala Asp Gin Val Arg 
35 40 45 

aga ttt aag cct gca ttg gtt get gtt aga aac gag tea ctg att aat 192 
Arg Phe Lys Pro Ala Leu Val Ala Val Arg Asn Glu Ser Leu He Asn 
50 55 60 

gag ctt aaa gag get tta get gat ttg gac tat aaa etc gag att att 240 
Glu Leu Lys Glu Ala Leu Ala Asp Leu Asp Tyr Lys Leu Glu He He 
65 .70 75 80 

cca gga gag caa gga gtg att gag gtt gee cga cat cct gaa get gta 288 
Pro Gly Glu Gin Gly Val He Glu Val Ala Arg His Pro Glu Ala Val 
85 90 95 

ace gtt gtt acc gga ata gta ggt tgt gcg gga eta aag cct acg gtt 336 
Thr Val Val Thr Gly He Val Gly Cys Ala Gly Leu Lys Pro Thr Val 
100 105 110 

get gca att gaa gca gga aag gac att get ctt gca aac aaa gag aca 384 
Ala Ala He Glu Ala Gly Lys Asp He Ala Leu Ala Asn Lys Glu Thr 
115 120 125 

tta ate gca ggt ggt cct ttc gtg ctt ccg ctt gee aac aaa cat aat 432 
Leu He Ala Gly Gly Pro Phe Val Leu Pro Leu Ala Asn Lys His Asn 
130 135 140 

gta aag att ctt ccg gca gat tea gaa cat tct gee ata ttt cag tgt 480 
Val Lys He Leu Pro Ala Asp Ser Glu His Ser Ala He Phe Gin Cys 
145 150 155 160 

att caa ggt ttg cct gaa ggc get ctg cgc aag ata ate ttg act gca 528 
He Gin Gly Leu Pro Glu Gly Ala Leu Arg Lys He He Leu Thr Ala 
165 170 175 



tct ggt gga get ttt agg gat tgg cct gtc gaa aag eta aag gaa gtt 
Ser Gly Gly Ala Phe Arg Asp Trp Pro Val Glu Lys Leu Lys Glu Val 
180 185 190 



576 
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aaa gta gcg gat gcg ttg aag cat cca aac tgg aac atg gga aag aaa 624 
Lys Val Ala Asp Ala Leu Lys His Pro Asn Trp Asn Met Gly Lys Lys 
195 200 205 

ate act gtg gac tct get acg ctt ttc aac aag ggt ctt gag gtc att 672 
lie Thr Val Asp Ser Ala Thr Leu Phe Asn Lys Gly Leu Glu Val lie 
210 215 220 

gaa gcg cat tat ttg ttt gga get gag tat gac gat ata gag att gtc 720 
Glu Ala His Tyr Leu Phe Gly Ala Glu Tyr Asp Asp lie Glu He Val 
225 230 235 240 

att cat ccg caa agt ate ata cat tec atg att gaa aca cag gat tea 768 
He His Pro Gin Ser He He His Ser Met He Glu Thr Gin Asp Ser 
245 - 250 255 

tct gtg ctt get caa ttg ggt tgg cct gat atg cgt tta ccg att etc 816 
Ser Val Leu Ala Gin Leu Gly Trp Pro Asp Met Arg Leu Pro He Leu 
260 265 270 

tac acc atg tea tgg ccc gat aga gtt cct tgt tct gaa gta act tgg 864 
Tyr Thr Met Ser Trp Pro Asp Arg Val Pro Cys Ser Glu Val Thr Trp 
275 280 285 

cca aga ctt gac ctt tgc aag etc ggt tea ttg act ttc aag aaa cca 912 
Pro Arg Leu Asp Leu Cys Lys Leu Gly Ser Leu Thr Phe Lys Lys Pro 
290 295 300 

gac aat gtg aaa tac cca tec atg gat ctt get tat get get gga cga 960 
Asp Asn Val Lys Tyr Pro Ser Met Asp Leu Ala Tyr Ala Ala Gly Arg 
305 310 315 320 

get gga ggc aca atg act gga gtt etc age gee gec aat gag aaa get 1008 
Ala Gly Gly Thr Met Thr Gly Val Leu Ser Ala Ala Asn Glu Lys Ala 
325 330 335 

gtt gaa atg ttc att gat gaa aag ata age tat ttg gat ate ttc aag 1056 
Val Glu Met Phe He Asp Glu Lys He Ser Tyr Leu Asp lie Phe Lys 
340 345 350 

gtt gtg gaa tta aca tgc gat aaa cat cga aac gag ttg gta aca tea 1104 
Val Val Glu Leu Thr Cys Asp Lys His Arg Asn Glu Leu Val Thr Ser 
355 360 365 

ccg tct ctt gaa gag att gtt cac tat gac ttg tgg gca cgt gaa tat 1152 
Pro Ser Leu Glu Glu He Val His Tyr Asp Leu Trp Ala Arg Glu Tyr 
370 375 380 

gee gcg aat gtg cag ctt tct tct ggt get agg cca gtt cat gca 1197 
Ala Ala Asn Val Gin Leu Ser Ser Gly Ala Arg Pro Val His Ala 
385 390 395 
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<210> 7 
<211> 399 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 7 

Gly Pro Lys Pro He Ser He Val Gly Ser Thr Gly Ser \Ele Gly Thr 
15 10 15 

Gin Thr Leu Asp He Val Ala Glu Asn Pro Asp Lys Phe Arg Val Val 
20 25 30 

Ala Leu Ala Ala Gly Ser Asn Val Thr Leu Leu Ala Asp Gin Val Arg 
35 40 45 

Arg Phe Lys Pro Ala Leu Val Ala Val Arg Asn Glu Ser Leu He Asn 
50 55 60 

Glu Leu Lys Glu Ala Leu Ala Asp Leu Asp Tyr Lys Leu Glu He He 
65 70 75 80 

Pro Gly Glu Gin Gly Val He Glu Val Ala Arg His Pro Glu Ala Val 
85 90 95 

Thr Val Val Thr Gly He Val Gly Cys Ala Gly Leu Lys Pro Thr Val 
100 105 HO 

Ala Ala He Glu Ala Gly Lys Asp He Ala Leu Ala Asn Lys Glu Thr 
115 120 125 

Leu He Ala Gly Gly Pro Phe Val Leu Pro Leu Ala Asn Lys His Asn 
130 135 140 

Val Lys He Leu Pro Ala Asp Ser Glu His Ser Ala He Phe Gin Cys 
145 150 155 160 

He Gin Gly Leu Pro Glu Gly Ala Leu Arg Lys He He Leu Thr Ala 
165 170 175 

Ser Gly Gly Ala Phe Arg Asp Trp Pro Val Glu Lys Leu Lys Glu Val 
180 185 190 

Lys Val Ala Asp Ala Leu Lys His Pro Asn Trp Asn Met Gly Lys Lys 
195 200 205 

He Thr Val Asp Ser Ala Thr Leu Phe Asn Lys Gly Leu Glu Val He 
210 215 220 

Glu Ala His Tyr Leu Phe Gly Ala Glu Tyr Asp Asp He Glu He Val 
225 230 235 240 



He His Pro Gin Ser He lie His Ser Met He Glu Thr Gin Asp Ser 
245 250 255 
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Ser Val Leu Ala Gin 
260 

Tyr Thr Met Ser Trp 
275 

Pro Arg Leu Asp Leu 
290 

Asp Asn Val Lys Tyr 
305 

Ala Gly Gly Thr Met 
325 

Val Glu Met Phe lie 
340 

Val Val Glu Leu Thr 
355 

Pro Ser Leu Glu Glu 
370 

Ala Ala Asn Val Gin 
385 



Leu Gly Trp Pro Asp Met 
265 

Pro Asp Arg Val Pro Cys 
280 

Cys Lys Leu Gly Ser Leu 
295 

Pro Ser Met Asp Leu Ala 
310 315 

Thr Gly Val Leu Ser Ala 
330 

Asp Glu Lys lie Ser Tyr 
345 

Cys Asp Lys His Arg Asn 
360 

He Val His Tyr Asp Leu 
375 

Leu Ser Ser Gly Ala Arg 
390 395 



Arg Leu Pro He Leu 
270 

Ser Glu Val Thr Trp 
285 

Thr Phe Lys Lys Pro 
300 

Tyr Ala Ala Gly Arg 
320 

Ala Asn Glu Lys Ala 
335 

Leu Asp He Phe Lys 
350 

Glu Leu Val Thr Ser 
365 

Trp Ala Arg Glu Tyr 
380 

Pro Val His Ala 



<210> 8 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
oligonucleotide 

<220> 

<221> misc_f eature 
<222> (1) . . (36) 
<223> PCR primer P3 

<400> 8 

gtctcaactc tggaagcttt atgaagcaac tctcac 



<210> 9 
<211> 33 
<212> DNA 

<213> Artificial Sequence 



<220> 
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<223> Description of Artificial Sequence: 
oligonucleotide 

<220> 

<221> misc_feature 
<222> (1) . . (33) 
<223> PCR primer P4 

<400> 9 

ctctgtagcc ggacctaggt cagcttgcga gac 33 

<210> 10 
<211> 295 
<212> DNA 

<213> Arabidopsis thaliana 
<220> 

<221> misc_feature 
<222> (1) . . (295) 

<223> Arabidopsis EST sequence wherein n represents an 
unknown nucleic acid base 

<400> 10 

gctgatttgg actataaact cgagattatn ccaggagagc aaggagtgat tnaggttgcc 60 

cgacatcctg aagctgtaac cgttgtnacc ggaatagtag gttgtncggg actaaagcct 120 

acggttgctg caattaaagc aggaaaggac attgctcttg caaacaaaga gacattaatc 180 

gcaggtggtc ctttcgtgcn tccgcttgcn aacaaacata atgtaaaaga ttctnccggc 240 

agattcagaa cattntgnca tatttnaaat gtattcaagg gtttgcctna agccg 295 

<210> 11 
<211> 8050 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 11 

atatatatca aaccaatata ttttattatc aagtttcatt acataatgtc tcatactaaa 60 
ccaacaaaaa taaacgtcag tatatttagc atatatttac tttgtcagta taccaaccct 120 
cattgcttaa tatataatgg aaatcaatct gaagtataac ctacaagttg tacgtgtcta 180 
atagtaaacg aagtaccacc ttagataatc tgatatcaca cataatagta attaataagg 240 
ttaaattatg aaaagaatga cttgcaagtt acgatttatg ataacttaaa gaagcttttt 300 
atcataaacc gaccaattga tttcctggta catttatatt aaaacatcat tattgcaaaa 360 



WO 00/46346 



-12- 



PCT/US00/02185 



taatgagtcg acaaatcaaa acttctattg 
aatctaatgt gaaggtgttt tcctatgcta 
atgattttag cggtggcagt aggttaaaaa 
taaggagagt gcatttatat ctttatccct 
aaaaaaaaac taattgtttt taattcaagt 
ttatttcttg attgtttcaa ataatggaaa 
caaaaagtaa atttgaaaga aaaaaaaggg 
cagagcaaca aaaaccatta tcgccctcgt 
gatacgacgt ttcaagtctc tcaacgatgg 
tgactagtca ttgcagagag aaaactttct 
gaaattcatt aacaaggctc agaatttgta 
gatgatgatc aatgtttata tctctgcatc 
tggaatcctt ggtgttgatc ctttagctga 
cttagctctg ttgcttcacc cggacaagaa 
gctggtttta gatgcttggt ctctactatc 
aagagaaaac caaaacaaga aaagagcgaa 
cctgcttctt cttcttcgtc gaaaccggtg 
ttttcgacag tatgcaataa atgcacaacg 
cttaacaaga cctttccttg tccaaactgt 
tcgacagagg tgatcaatgg gaggacattc 
gaaccatcga gggccaattc tcaagcaact 
aactctactg agagtttttt caagaaacca 
catgaagctc agaggctttt caagaaccca 
catgaagctc agaggctttt caagaaccct 
aacaattaag ctcggtttta ttggtaaaaa 
gatcacagat aaattagcta cacaatccat 
ccccattctc tacactaatc ttctttcaac 



ttccaaatcg cttttgccaa acaaattatt 420 
tgactaataa tttagttaaa attattccta 480 
gagtgcattt atatcttctt ctttttttgg 540 
acgattcgta actaaatcct ttaaaaaaga 600 
tttattgccg gtattagaaa cagaaaatat 660 
ccaaaaaaaa aggaaagaga aattagtaat 720 
aaatcaccat caattaagta aacccatcgc 760 
agcttcttca gtttctcgag tcatctctaa 840 
aatgtaataa ggaagaagct aaaagagcaa 900 
gagaacgatt acattggtca ttggtgcaaa 960 
tccaacgctc gatggtttga aacaaccttt 1020 
aaacaaagaa gaaggagaat ctgactggta 1080 
tgatgaaaca gtgaagaaac attacaagac 1140 
caggtttaat ggtgcggaag gtgcgtttaa 1200 
tgataaagct aagagaattg cgttgatcaa 1260 
ccatctgctt cgtgtaataa gcctgcagag 1320 
gacatgacct tttcgacagt gagcatgacc 1380 
agatgttgtc atttttcgac gcagaatcat 1440 
ggtcagaatt cggctatgac caatatatca 1500 
atcagagtct ctgtttctcc gcaacaagaa 1560 
agcagacgta gcacacgtca tgatgatgca 1620 
atgccgacaa caggagatgc aaactctact 1680 
atgacgacaa caggagatgc gaactctact 1740 
tagatgaatg taattaatca tataatgtga 1800 
tggtttcaaa ttatcagttt ggcttgttcg 1860 
aatccttgcc aaaaacgcta ttaagtagta 1920 
atttcctcag aagcttcctt atgttcttcc 1980 
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aacaaccaat tcttcatgca tgaactggcc 
catattcacg tgcccacaag tcatagtgaa 
actcgtttcg atgtttatcg catgttaatt 
tcctgtaaac aaaagtgaga atataaacaa 
tgctcaaaac tgaaaaataa ttcttacttt 
ttggcggcgc tgagaactcc agtcattgtg 
tccatggatg ggtatttcac attgtctggt 
aaatccacaa ttgtaaacaa cttttggttt 
ggtcctaacc cagtttaact gatccacacc 
ccaaaccgaa gaccgattcg gtttcatttt 
tgaaaacaag attggggaac ttttcttggt 
tcacacttga taaacagaga gtatataaat 
tggccaagtt acttcagaac aaggaactct 
taaacgcata tcaggccaac ccaattgagc 
aatacatgtt atacagttat ttttttaaaa 
ttcagcaaga cctgtgtttc aatcatggaa 
tctatatcgt catactcagc tccaaacaaa 
ttcaaaaaat caagaactca tctaccttga 
cttaggagaa aataatctta accttgttga 
ttcccatgtt ccagtttgga tgcttcaacg 
cgacaggcca atcccttttt caaaatccag 
gagaagaaaa aaagtctatg cagagagaga 
cagatgcagt caagattatc ttgcgcagag 
gaacataaaa gaagattttt cactcaaatt 
gctgaactca atatgaaagt tgaggtactt 
tggcagaatg ttctgaatct gccggaagaa 



tagcaccaga agaaagctgc acattcgcgg 2040 
caatctcttc aagagacggt gatgttacca 2100 
ccacaacctt gaagatatcc aaatagctta 2160 
ttgtgattcg tatcaagaac ttcattgaga 2220 
tcatcaatga acatttcaac agctttctca 2280 
cctccagctc gtccagcagc ataagcaaga 2340 
ttcttgaaag tcaatgaacc gagtctgcca 2400 
taggtgctga atgctgatag ataaggcagt 2460 
aaaacagtag caaaataacc aattgcaaaa 2520 
ttatcttatc taaacaacct aaaaccaaac 2580 
gataattaaa attttcaact aagcttagct 2640 
gtggttagct tacttgcaaa ggtcaagtct 2700 
atcgggccat gacatggtgt agagaatcgg 2760 
aagcacagat gaatcctgtg gaacaaaaca 2820 
ccggaaaaat aataatttag ttagtaatgt 2880 
tgtatgatac tttgcggatg aatgacaatc 2940 
taatgcgctt caatgacctc aagaccctgt 3000 
tcaaaggtat tttcaaaatc agagtttaac 3060 
aaagcgtagc agagtccaca gtgattttct 3120 
catccgctac tttaacttcc - tttagctttt 3180 
tgaaaagttt ccattaacca aacgagaatt 3240 
agaatatcga aacaaaccta aaagctccac 3300 
cgccttcagg caaaccttga atacactaga 3360 
gccagaggtt gaacttgcat taagaccaac 3420 
aattctatgt gatttgtgat acctgaaata 3480 
tctttacatt atgtttgttg gcaagcggaa 3540 
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gcacgaaagg accacctgcg attaatgtct 
cttcaattgc agcaaccgta ggctgcagta 
ctttcttttt tcatatcctc ttaataaggt 
tagtcccgca caacctacta ttccggtaac 
ctgttgatga acataataag taaaaaccta 
ctaacctcaa tcactccttg ctctcctgga 
aaagcctctt taagctcatt aatcagtgac 
aatctcctta cctgccacca ttcaaaatag 
gagattgcag aagcaaaagc ctaaaccaga 
aacgagttaa tactatcttg cttatgatac 
caagtggtct gaatgacaaa ttggagagac 
acttacctga tcagcaagta gagtaacatt 
tttgtcagga ttctcagcca caatatccaa 
aaatgatgca acaataactc agtaagaaaa 
cataagacaa acttaaagtc tggtcatact 
ataaaacctg agtgccaata gaaccagtag 
cccaagattg acgaggcgcc tcagggacag 
gttgetgctg cactttcact gaacacttaa 
tcctcctcaa actaaaccca cctgtgaaac 
gacctaaagc aaaccaaaaa aaatcgaatt 
attcacaaga gcctaagaca actaatgaaa 
ccaaggagga ggaaagaaga gaggaagaag 
aacctggagg tatccaagaa agaaatagct 
gtcatcatca gagtctttta aaaatcgaat 
attatcagag aagacgaatc agataaacag 
aatctggatt tgaatggtac ccaacagact 
tttagtaaca aggacctttt tattaaggta 



ctttgtttgc aagagcaatg tcctttcctg 3600 
aaaataagca acaagcttta tcatctgcaa 3660 
ttaataacaa aaaattagag tatatacctt 3720 
aacggttaca gcttcaggat gtcgggcaac 3780 
tctacactac aatcaaaact aacaaatgaa 3840 
ataatctcga gtttatagtc caaatcagct 3900 
tcgtttctaa cagcaaccaa tgcaggctta 3960 
aatcacagaa ccatactata gagatttctt 4020 
acctgatttc tctggtttga tctgatacat 4080 
taccactgaa ctgagaatta aactgaattc 4140 
tcaatactaa tttttttaca aatgaagcca 4200 
cgaaccagca gctagagcca caactctgaa 4260 
tgtctgcaaa atggaagttc ttgtcgataa 4320 
aaatatcatt cttctatgag tctagtcatt 4380 
caagaactgc acaataatgc cttaatcgaa 4440 
atccaacgat agagatgggt tttggtccat 4500 
ctctcccagg ccatgctgga ggaggttgtt 4560 
caccttttcc aaaacctctc ccttgattcc 4620 
actccaaaga tgtaaaattt aaaactctac 4 680 
gaagaaataa cagattacct agatagagaa 4740 
gtttgcaact ttaatcgaaa agagagttga 4800 
aagaaacctg agagtttagg gattggattg 4860 
ttggattcag ctggagatag tgagtttaat 4920 
attttccaga gaaccgcact actactcttg 4980 
tgtgagagag agagatgatg ataagaaagg 5040 
tttgtcattt tttaaagatt tcgctgagca 5100 
acgacaactt gtaagtggta aataatccag 5160 



WO 00/46346 



-15- 



PCT/US00/02185 



tcttactatg ttcccatttt ctatttgatt 
tcatcaatta tatagtttgt caaatataat 
aatggttaag gatttctctc ttacaaaata 
attatgaatt tttgatatga atatcttaaa 
ctgtcttttt caaaaataaa acatgttaca 
tttttttata aagtacatgt tatatgctgt 
ttagatcttt gacaagtata taatatactt 
ttcactatca ttcttttttt tttgtcaaca 
tgttcctcaa tgttcaattt gtaaatttaa 
tgaatttttt acgtatataa ttctctatat 
tttaaataaa attagtcttc ttgtagacta 
gtttgaatgg tgctctcttt tctttcttcg 
aaagaaaaag aaaaaagata atttacttta 
tatcacatta catagtgttt tcgtggggat 
agataatggt atgttggtat tggtagatga 
tcatctgagg acaagtgttg tacgttaagt 
aaacaagtgt tacttgctgc atccactcaa 
tctttaaaca tcggaaatcg gagcctgaat 
taattacggt gtagccatct ctccaattcc 
ggaggatagc aactctcacc cgcaaaatca 
aaagaagcaa cgtatggaga atgaaacacg 
tgactgtccg gtttgcttcg agccgctcac 
ttgcatgcat tttattttgt ttcatgtgac 
attgaatacg gctttgattg tatctcgttt 
acatatagtt tgcaattttt gctttgccaa 
tgatttaccc attggtaata agcgatgctt 



tctttagagt attaaacagc agaatctgta 5220 
tattattaga aatatgcatt acaagggatt 5280 
aaaaagaaaa agtttatggt attcgttcgt 5340 
ttgaatatgt tttgactaac atgttgtatg 5400 
tgtttttttt ttcttcttct cttttttttt 5460 
aacaattata atccaaatgt caaacttagt 5520 
ttctttttaa aaattatgta ttgaatattt 5580 
tttttcacta tcattcttat ttctttgata 5640 
atttcaaaag ccatgtaact ttaaccaact 5700 
ctctaattag agtcatgtta ggttcgattg 5760 
ttagatcatc cgttcaaaaa gattattgtt 5820 
gaaaggaata aaatttatcc cataaaaaga 5880 
tttaagtgtg attaagctgt tatgattgac 5940 
acagagatca atagataaat gataatggta 6000 
gtcagtaaat catttactac tgctaatgga 6060 
gacacatggc aaaacagtga aagagacgtt 6120 
attccatccc aagtcatgca tgcaactttt 6180 
taatgcgtta actaatggaa acaaaaacca 6240 
gattccattt caagttaacc ttatcgatat 6300 
aacatcaaaa agaaaaagct ctcacccgca 6360 
atcggctaag ttgttggatc ttgatgttct 6420 
tattcctacc tttcaggtta tgttttgaac 6480 
attttgattt cgcttttgtt aatttatttt 6540 
ggtatattat gcgtttcagt gtgatgatgg 6600 
agtgagtaac aagtgccctg gtcctgggtg 6660 
cgcaatggag agggttctcg aatcagcctt 6720 
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tgttccatgt caaaatactg agtttggctg cacaaaaagt gtctcttatg aaaaagtgtc 6780 
aagtcacgaa aaggaatgca actactctca atgctcttgc cctaacctcg aatgcaatta 6640 
cactggctca tataacatca tctacggtca ctttatgcgt cgccatcttt acaatagtac 6900 
gatcgtttcc tccaaatggg gatattccac tgttgatgtt ctaataaaca tcaaagaaaa 6960 
ggtttcagtt ctctgggaat ctcgtcagaa acttttgttt gtagttcagt gtttcaagga 7020 
gcgacatggt gtttatgtta ctgttagacg catcgcacca cctgcttcag aattcaagaa 7080 
gttctcgtat cgtctttcgt atagtatcga cggacataat gttacttacg aatcaccaga 7140 
agtaaagagg cttcttgaag tgaattctca aatccctgat gacagtttca tgtttgtccc 7200 
taactgttta ctgcatggtg aaatgttgga gttgaagctt ggcatcaaga agttgaaaca 7260 
aacgtaacta gatctagttt ggtttggggt tacgaggcgt tctgttttgt tgtgtttgtt 7320 
ttaattctct gtttaagaac ctttgtactt ttgtagtagc ccactcttga atttattgat 7380 
gttgttgttt tgagttagtt gtataatcca aaagctttct ggtttggttc ccggttcggt 7440 
tttgtacata gtaggatttt taataaagcc tgctaatgag gttcagcaag ttaccattgc 7500 
tcaggaaact gttatggagg atcctccaac gtctctgttt aagaattcag taccaattcg 7560 
agaggatcaa attcagaacg ctatcacaaa ttccattcgc taatcttaga attgggcata 7620 
aattctggaa taatgggctc atttggtatt agcgtccata cacattgtag gcccaataaa 7680 
ataatagacc aagaaaaaac taaaaaccgg acaacgccgt tatctcttct tcgtgtgacc 7740 
accacacata catacatacc actcaccgta ccaaaaagat tagaccaaca aaaaaaaaaa 7800 
aaaaaggacc agctcagatg agtctggagt ttccaagttt aaaacctctc tacctcgatt 7860 
tgagcaaatc ctgatttact ctcatcctca tcatctctca tcatcgagat tcatagtctc 7920 
ttttgccgct tggattcttc caaggttagt gagctgctat ggcaactcat cagcaaacgc 7980 
aacctccttc cgattttccc gctcttgccg atgaaaattc ccagattcca ggttcaattt 8040 
acaccctcta 



8050 



<210> 12 
<211> 7 
<212> PRT 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence: conserved 
motif 

<220> 

<221> PEPTIDE 
<222> (1) . . (7) 

<223> Conserved amino acid sequence motif at N terminus 
of deoxyxylulose phosphate reductoisomerase 

<400> 12 

Gly Ser Thr Gly Ser lie Gly 
1 5 



13 
10 
PRT 

Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: conserved 
motif 

<220> 

<221> PEPTIDE 
<222> (1) . . (10) 

<223> Conserved amino acid motif located at N terminus 
of ketol acid reductoisomerase wherein Xaa 
represents any amino acid 



<210> 
<211> 
<212> 
<213> 



<400> 13 

Gly Xaa Gly Xaa Xaa Gly Xaa Xaa Xaa Gly 
1 5 io 
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